Overview

Dataset statistics

Number of variables83
Number of observations89184
Missing cells2516072
Missing cells (%)34.0%
Total size in memory57.2 MiB
Average record size in memory672.0 B

Variable types

Text80
Numeric3

Alerts

Q120 has constant value ""Constant
Employment has 1286 (1.4%) missing valuesMissing
RemoteWork has 15374 (17.2%) missing valuesMissing
CodingActivities has 15420 (17.3%) missing valuesMissing
EdLevel has 1211 (1.4%) missing valuesMissing
LearnCode has 1521 (1.7%) missing valuesMissing
LearnCodeOnline has 19100 (21.4%) missing valuesMissing
LearnCodeCoursesCert has 52108 (58.4%) missing valuesMissing
YearsCode has 1749 (2.0%) missing valuesMissing
YearsCodePro has 23048 (25.8%) missing valuesMissing
DevType has 12312 (13.8%) missing valuesMissing
OrgSize has 24141 (27.1%) missing valuesMissing
PurchaseInfluence has 24220 (27.2%) missing valuesMissing
TechList has 28333 (31.8%) missing valuesMissing
BuyNewTool has 6175 (6.9%) missing valuesMissing
Country has 1211 (1.4%) missing valuesMissing
Currency has 23850 (26.7%) missing valuesMissing
CompTotal has 40959 (45.9%) missing valuesMissing
LanguageHaveWorkedWith has 2044 (2.3%) missing valuesMissing
LanguageWantToWorkWith has 8475 (9.5%) missing valuesMissing
DatabaseHaveWorkedWith has 15749 (17.7%) missing valuesMissing
DatabaseWantToWorkWith has 28273 (31.7%) missing valuesMissing
PlatformHaveWorkedWith has 25556 (28.7%) missing valuesMissing
PlatformWantToWorkWith has 37876 (42.5%) missing valuesMissing
WebframeHaveWorkedWith has 22246 (24.9%) missing valuesMissing
WebframeWantToWorkWith has 32443 (36.4%) missing valuesMissing
MiscTechHaveWorkedWith has 32165 (36.1%) missing valuesMissing
MiscTechWantToWorkWith has 42336 (47.5%) missing valuesMissing
ToolsTechHaveWorkedWith has 11300 (12.7%) missing valuesMissing
ToolsTechWantToWorkWith has 20869 (23.4%) missing valuesMissing
NEWCollabToolsHaveWorkedWith has 3320 (3.7%) missing valuesMissing
NEWCollabToolsWantToWorkWith has 12535 (14.1%) missing valuesMissing
OpSysPersonal use has 2627 (2.9%) missing valuesMissing
OpSysProfessional use has 10597 (11.9%) missing valuesMissing
OfficeStackAsyncHaveWorkedWith has 20094 (22.5%) missing valuesMissing
OfficeStackAsyncWantToWorkWith has 35441 (39.7%) missing valuesMissing
OfficeStackSyncHaveWorkedWith has 5745 (6.4%) missing valuesMissing
OfficeStackSyncWantToWorkWith has 19408 (21.8%) missing valuesMissing
AISearchHaveWorkedWith has 32856 (36.8%) missing valuesMissing
AISearchWantToWorkWith has 43034 (48.3%) missing valuesMissing
AIDevHaveWorkedWith has 63280 (71.0%) missing valuesMissing
AIDevWantToWorkWith has 69597 (78.0%) missing valuesMissing
NEWSOSites has 1211 (1.4%) missing valuesMissing
SOVisitFreq has 2044 (2.3%) missing valuesMissing
SOAccount has 1332 (1.5%) missing valuesMissing
SOPartFreq has 23123 (25.9%) missing valuesMissing
SOComm has 1492 (1.7%) missing valuesMissing
SOAI has 41326 (46.3%) missing valuesMissing
AISelect has 1211 (1.4%) missing valuesMissing
AISent has 27683 (31.0%) missing valuesMissing
AIAcc has 50590 (56.7%) missing valuesMissing
AIBen has 27788 (31.2%) missing valuesMissing
AIToolInterested in Using has 56401 (63.2%) missing valuesMissing
AIToolCurrently Using has 53047 (59.5%) missing valuesMissing
AIToolNot interested in Using has 68115 (76.4%) missing valuesMissing
AINextVery different has 76523 (85.8%) missing valuesMissing
AINextNeither different nor similar has 82585 (92.6%) missing valuesMissing
AINextSomewhat similar has 82946 (93.0%) missing valuesMissing
AINextVery similar has 86563 (97.1%) missing valuesMissing
AINextSomewhat different has 65881 (73.9%) missing valuesMissing
TBranch has 23416 (26.3%) missing valuesMissing
ICorPM has 45516 (51.0%) missing valuesMissing
WorkExp has 45605 (51.1%) missing valuesMissing
Knowledge_1 has 46649 (52.3%) missing valuesMissing
Knowledge_2 has 47514 (53.3%) missing valuesMissing
Knowledge_3 has 47386 (53.1%) missing valuesMissing
Knowledge_4 has 47500 (53.3%) missing valuesMissing
Knowledge_5 has 47657 (53.4%) missing valuesMissing
Knowledge_6 has 47664 (53.4%) missing valuesMissing
Knowledge_7 has 47717 (53.5%) missing valuesMissing
Knowledge_8 has 47780 (53.6%) missing valuesMissing
Frequency_1 has 47268 (53.0%) missing valuesMissing
Frequency_2 has 47259 (53.0%) missing valuesMissing
Frequency_3 has 48130 (54.0%) missing valuesMissing
TimeSearching has 46406 (52.0%) missing valuesMissing
TimeAnswering has 46555 (52.2%) missing valuesMissing
ProfessionalTech has 47401 (53.1%) missing valuesMissing
Industry has 52410 (58.8%) missing valuesMissing
SurveyLength has 2699 (3.0%) missing valuesMissing
SurveyEase has 2630 (2.9%) missing valuesMissing
ConvertedCompYearly has 41165 (46.2%) missing valuesMissing
CompTotal is highly skewed (γ1 = 219.6019126)Skewed
ConvertedCompYearly is highly skewed (γ1 = 94.73829622)Skewed

Reproduction

Analysis started2023-12-09 09:13:21.570761
Analysis finished2023-12-09 09:13:28.966407
Duration7.4 seconds
Software versionydata-profiling vv4.6.3
Download configurationconfig.json

Variables

Q120
Text

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
2023-12-09T14:43:29.034653image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length7
Median length7
Mean length7
Min length7

Characters and Unicode

Total characters624288
Distinct characters6
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowI agree
2nd rowI agree
3rd rowI agree
4th rowI agree
5th rowI agree
ValueCountFrequency (%)
i 89184
50.0%
agree 89184
50.0%
2023-12-09T14:43:29.174141image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 178368
28.6%
I 89184
14.3%
89184
14.3%
a 89184
14.3%
g 89184
14.3%
r 89184
14.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 445920
71.4%
Uppercase Letter 89184
 
14.3%
Space Separator 89184
 
14.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 178368
40.0%
a 89184
20.0%
g 89184
20.0%
r 89184
20.0%
Uppercase Letter
ValueCountFrequency (%)
I 89184
100.0%
Space Separator
ValueCountFrequency (%)
89184
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 535104
85.7%
Common 89184
 
14.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 178368
33.3%
I 89184
16.7%
a 89184
16.7%
g 89184
16.7%
r 89184
16.7%
Common
ValueCountFrequency (%)
89184
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 624288
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 178368
28.6%
I 89184
14.3%
89184
14.3%
a 89184
14.3%
g 89184
14.3%
r 89184
14.3%
Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
2023-12-09T14:43:29.270499image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length85
Median length30
Mean length35.1661733
Min length13

Characters and Unicode

Total characters3136260
Distinct characters27
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNone of these
2nd rowI am a developer by profession
3rd rowI am a developer by profession
4th rowI am a developer by profession
5th rowI am a developer by profession
ValueCountFrequency (%)
i 96927
15.5%
am 83013
13.2%
a 83012
13.2%
developer 78052
12.4%
by 69098
11.0%
profession 69098
11.0%
code 18875
 
3.0%
primarily 13914
 
2.2%
as 13914
 
2.2%
but 10815
 
1.7%
Other values (16) 90498
14.4%
2023-12-09T14:43:29.443048image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
538032
17.2%
e 372122
11.9%
o 288865
 
9.2%
r 208662
 
6.7%
a 207768
 
6.6%
s 190998
 
6.1%
p 170018
 
5.4%
i 128749
 
4.1%
m 123789
 
3.9%
d 107742
 
3.4%
Other values (17) 799515
25.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 2480321
79.1%
Space Separator 538032
 
17.2%
Uppercase Letter 98138
 
3.1%
Other Punctuation 19769
 
0.6%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 372122
15.0%
o 288865
11.6%
r 208662
 
8.4%
a 207768
 
8.4%
s 190998
 
7.7%
p 170018
 
6.9%
i 128749
 
5.2%
m 123789
 
5.0%
d 107742
 
4.3%
l 98788
 
4.0%
Other values (12) 582820
23.5%
Uppercase Letter
ValueCountFrequency (%)
I 96927
98.8%
N 1211
 
1.2%
Other Punctuation
ValueCountFrequency (%)
, 10815
54.7%
/ 8954
45.3%
Space Separator
ValueCountFrequency (%)
538032
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 2578459
82.2%
Common 557801
 
17.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 372122
14.4%
o 288865
11.2%
r 208662
 
8.1%
a 207768
 
8.1%
s 190998
 
7.4%
p 170018
 
6.6%
i 128749
 
5.0%
m 123789
 
4.8%
d 107742
 
4.2%
l 98788
 
3.8%
Other values (14) 680958
26.4%
Common
ValueCountFrequency (%)
538032
96.5%
, 10815
 
1.9%
/ 8954
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3136260
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
538032
17.2%
e 372122
11.9%
o 288865
 
9.2%
r 208662
 
6.7%
a 207768
 
6.6%
s 190998
 
6.1%
p 170018
 
5.4%
i 128749
 
4.1%
m 123789
 
3.9%
d 107742
 
3.4%
Other values (17) 799515
25.5%

Age
Text

Distinct8
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
2023-12-09T14:43:29.533632image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length18
Median length15
Mean length15.17518837
Min length15

Characters and Unicode

Total characters1353384
Distinct characters22
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row18-24 years old
2nd row25-34 years old
3rd row45-54 years old
4th row25-34 years old
5th row25-34 years old
ValueCountFrequency (%)
years 88735
32.5%
old 87564
32.0%
25-34 33247
 
12.2%
35-44 20532
 
7.5%
18-24 17931
 
6.6%
45-54 8334
 
3.0%
under 4128
 
1.5%
18 4128
 
1.5%
55-64 3392
 
1.2%
65 1171
 
0.4%
Other values (6) 4138
 
1.5%
2023-12-09T14:43:29.688373image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
184116
13.6%
4 112302
 
8.3%
r 96103
 
7.1%
e 94932
 
7.0%
d 92863
 
6.9%
o 90804
 
6.7%
y 89184
 
6.6%
a 89184
 
6.6%
s 89184
 
6.6%
l 88735
 
6.6%
Other values (12) 325977
24.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 736913
54.4%
Decimal Number 344342
25.4%
Space Separator 184116
 
13.6%
Dash Punctuation 83436
 
6.2%
Uppercase Letter 4577
 
0.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
r 96103
13.0%
e 94932
12.9%
d 92863
12.6%
o 90804
12.3%
y 89184
12.1%
a 89184
12.1%
s 89184
12.1%
l 88735
12.0%
n 4577
 
0.6%
t 898
 
0.1%
Decimal Number
ValueCountFrequency (%)
4 112302
32.6%
5 78402
22.8%
3 53779
15.6%
2 51178
14.9%
1 22059
 
6.4%
8 22059
 
6.4%
6 4563
 
1.3%
Uppercase Letter
ValueCountFrequency (%)
U 4128
90.2%
P 449
 
9.8%
Space Separator
ValueCountFrequency (%)
184116
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 83436
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 741490
54.8%
Common 611894
45.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
r 96103
13.0%
e 94932
12.8%
d 92863
12.5%
o 90804
12.2%
y 89184
12.0%
a 89184
12.0%
s 89184
12.0%
l 88735
12.0%
n 4577
 
0.6%
U 4128
 
0.6%
Other values (3) 1796
 
0.2%
Common
ValueCountFrequency (%)
184116
30.1%
4 112302
18.4%
- 83436
13.6%
5 78402
12.8%
3 53779
 
8.8%
2 51178
 
8.4%
1 22059
 
3.6%
8 22059
 
3.6%
6 4563
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1353384
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
184116
13.6%
4 112302
 
8.3%
r 96103
 
7.1%
e 94932
 
7.0%
d 92863
 
6.9%
o 90804
 
6.7%
y 89184
 
6.6%
a 89184
 
6.6%
s 89184
 
6.6%
l 88735
 
6.6%
Other values (12) 325977
24.1%

Employment
Text

MISSING 

Distinct106
Distinct (%)0.1%
Missing1286
Missing (%)1.4%
Memory size1.4 MiB
2023-12-09T14:43:29.795470image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length212
Median length19
Mean length28.17986757
Min length7

Characters and Unicode

Total characters2476954
Distinct characters29
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)< 0.1%

Sample

1st rowEmployed, full-time
2nd rowEmployed, full-time
3rd rowEmployed, full-time
4th rowEmployed, full-time;Independent contractor, freelancer, or self-employed
5th rowEmployed, full-time
ValueCountFrequency (%)
employed 68821
26.9%
full-time 63463
24.8%
contractor 13988
 
5.5%
freelancer 13988
 
5.5%
or 13988
 
5.5%
self-employed 11430
 
4.5%
student 9968
 
3.9%
independent 9137
 
3.6%
part-time 7424
 
2.9%
not 6316
 
2.5%
Other values (29) 37779
14.7%
2023-12-09T14:43:29.987292image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 301752
 
12.2%
l 264342
 
10.7%
t 174736
 
7.1%
168404
 
6.8%
m 166560
 
6.7%
o 157706
 
6.4%
d 130725
 
5.3%
, 114620
 
4.6%
p 108430
 
4.4%
f 106750
 
4.3%
Other values (19) 782929
31.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1982918
80.1%
Space Separator 168404
 
6.8%
Other Punctuation 128640
 
5.2%
Uppercase Letter 101918
 
4.1%
Dash Punctuation 95074
 
3.8%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 301752
15.2%
l 264342
13.3%
t 174736
8.8%
m 166560
8.4%
o 157706
8.0%
d 130725
 
6.6%
p 108430
 
5.5%
f 106750
 
5.4%
n 93965
 
4.7%
u 92003
 
4.6%
Other values (10) 385949
19.5%
Uppercase Letter
ValueCountFrequency (%)
E 65928
64.7%
S 15158
 
14.9%
I 14537
 
14.3%
N 5558
 
5.5%
R 737
 
0.7%
Other Punctuation
ValueCountFrequency (%)
, 114620
89.1%
; 14020
 
10.9%
Space Separator
ValueCountFrequency (%)
168404
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 95074
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 2084836
84.2%
Common 392118
 
15.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 301752
14.5%
l 264342
12.7%
t 174736
 
8.4%
m 166560
 
8.0%
o 157706
 
7.6%
d 130725
 
6.3%
p 108430
 
5.2%
f 106750
 
5.1%
n 93965
 
4.5%
u 92003
 
4.4%
Other values (15) 487867
23.4%
Common
ValueCountFrequency (%)
168404
42.9%
, 114620
29.2%
- 95074
24.2%
; 14020
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2476954
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 301752
 
12.2%
l 264342
 
10.7%
t 174736
 
7.1%
168404
 
6.8%
m 166560
 
6.7%
o 157706
 
6.4%
d 130725
 
5.3%
, 114620
 
4.6%
p 108430
 
4.4%
f 106750
 
4.3%
Other values (19) 782929
31.6%

RemoteWork
Text

MISSING 

Distinct3
Distinct (%)< 0.1%
Missing15374
Missing (%)17.2%
Memory size1.4 MiB
2023-12-09T14:43:30.078358image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length36
Median length9
Mean length19.14549519
Min length6

Characters and Unicode

Total characters1413129
Distinct characters20
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowRemote
2nd rowHybrid (some remote, some in-person)
3rd rowHybrid (some remote, some in-person)
4th rowRemote
5th rowRemote
ValueCountFrequency (%)
some 62262
31.4%
remote 61697
31.1%
in-person 43244
21.8%
hybrid 31131
15.7%
2023-12-09T14:43:30.228653image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 228900
16.2%
o 167203
11.8%
124524
8.8%
m 123959
8.8%
r 105506
 
7.5%
s 105506
 
7.5%
n 86488
 
6.1%
i 62262
 
4.4%
t 61697
 
4.4%
p 43244
 
3.1%
Other values (10) 303840
21.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1078158
76.3%
Space Separator 124524
 
8.8%
Uppercase Letter 73810
 
5.2%
Dash Punctuation 43244
 
3.1%
Other Punctuation 31131
 
2.2%
Close Punctuation 31131
 
2.2%
Open Punctuation 31131
 
2.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 228900
21.2%
o 167203
15.5%
m 123959
11.5%
r 105506
9.8%
s 105506
9.8%
n 86488
 
8.0%
i 62262
 
5.8%
t 61697
 
5.7%
p 43244
 
4.0%
y 31131
 
2.9%
Other values (2) 62262
 
5.8%
Uppercase Letter
ValueCountFrequency (%)
H 31131
42.2%
R 30566
41.4%
I 12113
 
16.4%
Space Separator
ValueCountFrequency (%)
124524
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 43244
100.0%
Other Punctuation
ValueCountFrequency (%)
, 31131
100.0%
Close Punctuation
ValueCountFrequency (%)
) 31131
100.0%
Open Punctuation
ValueCountFrequency (%)
( 31131
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1151968
81.5%
Common 261161
 
18.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 228900
19.9%
o 167203
14.5%
m 123959
10.8%
r 105506
9.2%
s 105506
9.2%
n 86488
 
7.5%
i 62262
 
5.4%
t 61697
 
5.4%
p 43244
 
3.8%
H 31131
 
2.7%
Other values (5) 136072
11.8%
Common
ValueCountFrequency (%)
124524
47.7%
- 43244
 
16.6%
, 31131
 
11.9%
) 31131
 
11.9%
( 31131
 
11.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1413129
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 228900
16.2%
o 167203
11.8%
124524
8.8%
m 123959
8.8%
r 105506
 
7.5%
s 105506
 
7.5%
n 86488
 
6.1%
i 62262
 
4.4%
t 61697
 
4.4%
p 43244
 
3.1%
Other values (10) 303840
21.5%

CodingActivities
Text

MISSING 

Distinct116
Distinct (%)0.2%
Missing15420
Missing (%)17.3%
Memory size1.4 MiB
2023-12-09T14:43:30.346298image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length205
Median length180
Mean length51.51102164
Min length5

Characters and Unicode

Total characters3799659
Distinct characters39
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8 ?
Unique (%)< 0.1%

Sample

1st rowHobby;Contribute to open-source projects;Bootstrapping a business;Professional development or self-paced learning from online courses
2nd rowHobby;Professional development or self-paced learning from online courses
3rd rowHobby
4th rowHobby;Contribute to open-source projects;Professional development or self-paced learning from online courses
5th rowHobby;Professional development or self-paced learning from online courses
ValueCountFrequency (%)
or 35593
 
8.4%
work 29359
 
6.9%
self-paced 26957
 
6.4%
learning 26957
 
6.4%
online 26957
 
6.4%
development 26957
 
6.4%
from 26957
 
6.4%
open-source 18231
 
4.3%
to 18231
 
4.3%
courses 18038
 
4.3%
Other values (38) 170080
40.1%
2023-12-09T14:43:30.569377image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
o 459487
 
12.1%
e 399586
 
10.5%
350553
 
9.2%
r 269808
 
7.1%
n 256115
 
6.7%
s 223592
 
5.9%
c 169049
 
4.4%
t 167783
 
4.4%
l 158861
 
4.2%
a 148427
 
3.9%
Other values (29) 1196398
31.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 3170453
83.4%
Space Separator 350553
 
9.2%
Uppercase Letter 140308
 
3.7%
Other Punctuation 81984
 
2.2%
Dash Punctuation 45188
 
1.2%
Final Punctuation 8809
 
0.2%
Open Punctuation 1182
 
< 0.1%
Close Punctuation 1182
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o 459487
14.5%
e 399586
12.6%
r 269808
 
8.5%
n 256115
 
8.1%
s 223592
 
7.1%
c 169049
 
5.3%
t 167783
 
5.3%
l 158861
 
5.0%
a 148427
 
4.7%
i 138315
 
4.4%
Other values (13) 779430
24.6%
Uppercase Letter
ValueCountFrequency (%)
H 51942
37.0%
P 26957
19.2%
C 18231
 
13.0%
F 14258
 
10.2%
B 10293
 
7.3%
I 8809
 
6.3%
S 8636
 
6.2%
O 1182
 
0.8%
Other Punctuation
ValueCountFrequency (%)
; 66544
81.2%
/ 14258
 
17.4%
: 1182
 
1.4%
Space Separator
ValueCountFrequency (%)
350553
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 45188
100.0%
Final Punctuation
ValueCountFrequency (%)
8809
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1182
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1182
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 3310761
87.1%
Common 488898
 
12.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
o 459487
13.9%
e 399586
12.1%
r 269808
 
8.1%
n 256115
 
7.7%
s 223592
 
6.8%
c 169049
 
5.1%
t 167783
 
5.1%
l 158861
 
4.8%
a 148427
 
4.5%
i 138315
 
4.2%
Other values (21) 919738
27.8%
Common
ValueCountFrequency (%)
350553
71.7%
; 66544
 
13.6%
- 45188
 
9.2%
/ 14258
 
2.9%
8809
 
1.8%
( 1182
 
0.2%
) 1182
 
0.2%
: 1182
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3790850
99.8%
Punctuation 8809
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o 459487
 
12.1%
e 399586
 
10.5%
350553
 
9.2%
r 269808
 
7.1%
n 256115
 
6.8%
s 223592
 
5.9%
c 169049
 
4.5%
t 167783
 
4.4%
l 158861
 
4.2%
a 148427
 
3.9%
Other values (28) 1187589
31.3%
Punctuation
ValueCountFrequency (%)
8809
100.0%

EdLevel
Text

MISSING 

Distinct8
Distinct (%)< 0.1%
Missing1211
Missing (%)1.4%
Memory size1.4 MiB
2023-12-09T14:43:30.685468image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length82
Median length54
Mean length48.76636013
Min length14

Characters and Unicode

Total characters4290123
Distinct characters36
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowBachelor’s degree (B.A., B.S., B.Eng., etc.)
2nd rowBachelor’s degree (B.A., B.S., B.Eng., etc.)
3rd rowBachelor’s degree (B.A., B.S., B.Eng., etc.)
4th rowBachelor’s degree (B.A., B.S., B.Eng., etc.)
5th rowSome college/university study without earning a degree
ValueCountFrequency (%)
degree 75696
 
12.8%
etc 72840
 
12.3%
bachelor’s 36706
 
6.2%
b.a 36706
 
6.2%
b.s 36706
 
6.2%
b.eng 36706
 
6.2%
master’s 20543
 
3.5%
m.a 20543
 
3.5%
m.s 20543
 
3.5%
m.eng 20543
 
3.5%
Other values (27) 214647
36.2%
2023-12-09T14:43:30.867677image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
504206
 
11.8%
e 486158
 
11.3%
. 453130
 
10.6%
, 231246
 
5.4%
r 201641
 
4.7%
g 175720
 
4.1%
c 170496
 
4.0%
B 167367
 
3.9%
s 153654
 
3.6%
t 146582
 
3.4%
Other values (26) 1599923
37.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 2313946
53.9%
Other Punctuation 698034
 
16.3%
Uppercase Letter 571008
 
13.3%
Space Separator 504206
 
11.8%
Open Punctuation 72840
 
1.7%
Close Punctuation 72840
 
1.7%
Final Punctuation 57249
 
1.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 486158
21.0%
r 201641
8.7%
g 175720
 
7.6%
c 170496
 
7.4%
s 153654
 
6.6%
t 146582
 
6.3%
o 141213
 
6.1%
a 135744
 
5.9%
n 135363
 
5.8%
l 104972
 
4.5%
Other values (9) 462403
20.0%
Uppercase Letter
ValueCountFrequency (%)
B 167367
29.3%
M 106602
18.7%
A 97917
17.1%
S 82181
14.4%
E 61136
 
10.7%
G 17794
 
3.1%
D 15548
 
2.7%
P 9679
 
1.7%
R 8897
 
1.6%
J 3887
 
0.7%
Other Punctuation
ValueCountFrequency (%)
. 453130
64.9%
, 231246
33.1%
/ 13658
 
2.0%
Space Separator
ValueCountFrequency (%)
504206
100.0%
Open Punctuation
ValueCountFrequency (%)
( 72840
100.0%
Close Punctuation
ValueCountFrequency (%)
) 72840
100.0%
Final Punctuation
ValueCountFrequency (%)
57249
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 2884954
67.2%
Common 1405169
32.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 486158
16.9%
r 201641
 
7.0%
g 175720
 
6.1%
c 170496
 
5.9%
B 167367
 
5.8%
s 153654
 
5.3%
t 146582
 
5.1%
o 141213
 
4.9%
a 135744
 
4.7%
n 135363
 
4.7%
Other values (19) 971016
33.7%
Common
ValueCountFrequency (%)
504206
35.9%
. 453130
32.2%
, 231246
16.5%
( 72840
 
5.2%
) 72840
 
5.2%
57249
 
4.1%
/ 13658
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4232874
98.7%
Punctuation 57249
 
1.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
504206
 
11.9%
e 486158
 
11.5%
. 453130
 
10.7%
, 231246
 
5.5%
r 201641
 
4.8%
g 175720
 
4.2%
c 170496
 
4.0%
B 167367
 
4.0%
s 153654
 
3.6%
t 146582
 
3.5%
Other values (25) 1542674
36.4%
Punctuation
ValueCountFrequency (%)
57249
100.0%

LearnCode
Text

MISSING 

Distinct790
Distinct (%)0.9%
Missing1521
Missing (%)1.7%
Memory size1.4 MiB
2023-12-09T14:43:31.003211image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length274
Median length218
Mean length108.4716129
Min length9

Characters and Unicode

Total characters9508947
Distinct characters39
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique112 ?
Unique (%)0.1%

Sample

1st rowBooks / Physical media;Colleague;Friend or family member;Hackathons (virtual or in-person);Online Courses or Certification;On the job training;Other online resources (e.g., videos, blogs, forum);School (i.e., University, College, etc)
2nd rowBooks / Physical media;Colleague;On the job training;Other online resources (e.g., videos, blogs, forum);School (i.e., University, College, etc)
3rd rowColleague;Friend or family member;Other online resources (e.g., videos, blogs, forum);School (i.e., University, College, etc)
4th rowBooks / Physical media;Online Courses or Certification;Other online resources (e.g., videos, blogs, forum);School (i.e., University, College, etc)
5th rowBooks / Physical media;Colleague;Online Courses or Certification;Other online resources (e.g., videos, blogs, forum)
ValueCountFrequency (%)
online 83053
 
7.3%
resources 70244
 
6.1%
e.g 70244
 
6.1%
videos 70244
 
6.1%
blogs 70244
 
6.1%
or 60170
 
5.3%
books 45406
 
4.0%
45406
 
4.0%
physical 45406
 
4.0%
i.e 43957
 
3.8%
Other values (70) 537953
47.1%
2023-12-09T14:43:31.230603image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1054664
 
11.1%
e 956329
 
10.1%
o 821250
 
8.6%
i 664726
 
7.0%
r 551274
 
5.8%
s 527115
 
5.5%
n 474825
 
5.0%
l 424432
 
4.5%
t 353439
 
3.7%
, 342603
 
3.6%
Other values (29) 3338290
35.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 6841891
72.0%
Space Separator 1054664
 
11.1%
Other Punctuation 828932
 
8.7%
Uppercase Letter 523057
 
5.5%
Open Punctuation 126685
 
1.3%
Close Punctuation 126685
 
1.3%
Dash Punctuation 7033
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 956329
14.0%
o 821250
12.0%
i 664726
9.7%
r 551274
 
8.1%
s 527115
 
7.7%
n 474825
 
6.9%
l 424432
 
6.2%
t 353439
 
5.2%
c 267851
 
3.9%
g 253950
 
3.7%
Other values (12) 1546700
22.6%
Uppercase Letter
ValueCountFrequency (%)
C 159484
30.5%
O 159276
30.5%
B 54008
 
10.3%
P 45406
 
8.7%
S 43957
 
8.4%
U 43957
 
8.4%
F 9936
 
1.9%
H 7033
 
1.3%
Other Punctuation
ValueCountFrequency (%)
, 342603
41.3%
. 228402
27.6%
; 207070
25.0%
/ 45406
 
5.5%
: 5451
 
0.7%
Space Separator
ValueCountFrequency (%)
1054664
100.0%
Open Punctuation
ValueCountFrequency (%)
( 126685
100.0%
Close Punctuation
ValueCountFrequency (%)
) 126685
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7033
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 7364948
77.5%
Common 2143999
 
22.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 956329
13.0%
o 821250
 
11.2%
i 664726
 
9.0%
r 551274
 
7.5%
s 527115
 
7.2%
n 474825
 
6.4%
l 424432
 
5.8%
t 353439
 
4.8%
c 267851
 
3.6%
g 253950
 
3.4%
Other values (20) 2069757
28.1%
Common
ValueCountFrequency (%)
1054664
49.2%
, 342603
 
16.0%
. 228402
 
10.7%
; 207070
 
9.7%
( 126685
 
5.9%
) 126685
 
5.9%
/ 45406
 
2.1%
- 7033
 
0.3%
: 5451
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 9508947
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1054664
 
11.1%
e 956329
 
10.1%
o 821250
 
8.6%
i 664726
 
7.0%
r 551274
 
5.8%
s 527115
 
5.5%
n 474825
 
5.0%
l 424432
 
4.5%
t 353439
 
3.7%
, 342603
 
3.6%
Other values (29) 3338290
35.1%

LearnCodeOnline
Text

MISSING 

Distinct7940
Distinct (%)11.3%
Missing19100
Missing (%)21.4%
Memory size1.4 MiB
2023-12-09T14:43:31.367654image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length419
Median length330
Mean length171.2553507
Min length5

Characters and Unicode

Total characters12002260
Distinct characters46
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3649 ?
Unique (%)5.2%

Sample

1st rowFormal documentation provided by the owner of the tech;Blogs with tips and tricks;Books;Recorded coding sessions;How-to videos;Video-based Online Courses;Written-based Online Courses;Auditory material (e.g., podcasts);Online challenges (e.g., daily or weekly coding challenges);Written Tutorials;Click to write Choice 20;Stack Overflow
2nd rowFormal documentation provided by the owner of the tech;Blogs with tips and tricks;How-to videos;Online challenges (e.g., daily or weekly coding challenges);Written Tutorials;Click to write Choice 20;Stack Overflow
3rd rowFormal documentation provided by the owner of the tech;Blogs with tips and tricks;Auditory material (e.g., podcasts);Written Tutorials;Stack Overflow;Interactive tutorial
4th rowFormal documentation provided by the owner of the tech;Blogs with tips and tricks;Books;How-to videos;Video-based Online Courses;Online challenges (e.g., daily or weekly coding challenges);Written Tutorials;Click to write Choice 20;Interactive tutorial;Certification videos
5th rowFormal documentation provided by the owner of the tech;Blogs with tips and tricks;Books;Recorded coding sessions;How-to videos;Written Tutorials;Stack Overflow;Interactive tutorial
ValueCountFrequency (%)
the 126658
 
9.2%
formal 63329
 
4.6%
provided 63329
 
4.6%
by 63329
 
4.6%
owner 63329
 
4.6%
of 63329
 
4.6%
documentation 63329
 
4.6%
online 59993
 
4.4%
tips 53745
 
3.9%
and 53745
 
3.9%
Other values (170) 699759
50.9%
2023-12-09T14:43:31.589053image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1303790
 
10.9%
e 1084259
 
9.0%
o 1017355
 
8.5%
t 924717
 
7.7%
i 800635
 
6.7%
r 603372
 
5.0%
s 594649
 
5.0%
n 579728
 
4.8%
d 491074
 
4.1%
a 459911
 
3.8%
Other values (36) 4142770
34.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 9330492
77.7%
Space Separator 1303790
 
10.9%
Uppercase Letter 709970
 
5.9%
Other Punctuation 449571
 
3.7%
Dash Punctuation 102087
 
0.9%
Decimal Number 59560
 
0.5%
Open Punctuation 23395
 
0.2%
Close Punctuation 23395
 
0.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 1084259
11.6%
o 1017355
10.9%
t 924717
 
9.9%
i 800635
 
8.6%
r 603372
 
6.5%
s 594649
 
6.4%
n 579728
 
6.2%
d 491074
 
5.3%
a 459911
 
4.9%
l 441566
 
4.7%
Other values (12) 2333226
25.0%
Uppercase Letter
ValueCountFrequency (%)
O 135623
19.1%
C 128824
18.1%
B 84177
11.9%
W 67321
9.5%
F 63329
8.9%
S 57861
8.1%
H 42149
 
5.9%
T 42012
 
5.9%
V 34629
 
4.9%
R 19690
 
2.8%
Other values (4) 34355
 
4.8%
Other Punctuation
ValueCountFrequency (%)
; 383946
85.4%
. 42230
 
9.4%
, 21115
 
4.7%
: 2280
 
0.5%
Decimal Number
ValueCountFrequency (%)
0 29780
50.0%
2 29780
50.0%
Space Separator
ValueCountFrequency (%)
1303790
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 102087
100.0%
Open Punctuation
ValueCountFrequency (%)
( 23395
100.0%
Close Punctuation
ValueCountFrequency (%)
) 23395
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 10040462
83.7%
Common 1961798
 
16.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 1084259
 
10.8%
o 1017355
 
10.1%
t 924717
 
9.2%
i 800635
 
8.0%
r 603372
 
6.0%
s 594649
 
5.9%
n 579728
 
5.8%
d 491074
 
4.9%
a 459911
 
4.6%
l 441566
 
4.4%
Other values (26) 3043196
30.3%
Common
ValueCountFrequency (%)
1303790
66.5%
; 383946
 
19.6%
- 102087
 
5.2%
. 42230
 
2.2%
0 29780
 
1.5%
2 29780
 
1.5%
( 23395
 
1.2%
) 23395
 
1.2%
, 21115
 
1.1%
: 2280
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 12002260
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1303790
 
10.9%
e 1084259
 
9.0%
o 1017355
 
8.5%
t 924717
 
7.7%
i 800635
 
6.7%
r 603372
 
5.0%
s 594649
 
5.0%
n 579728
 
4.8%
d 491074
 
4.1%
a 459911
 
3.8%
Other values (36) 4142770
34.5%

LearnCodeCoursesCert
Text

MISSING 

Distinct210
Distinct (%)0.6%
Missing52108
Missing (%)58.4%
Memory size1.4 MiB
2023-12-09T14:43:31.711248image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length65
Median length53
Mean length14.38750135
Min length3

Characters and Unicode

Total characters533431
Distinct characters24
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)0.1%

Sample

1st rowOther
2nd rowOther;Codecademy;edX
3rd rowOther
4th rowUdemy
5th rowCodecademy;edX
ValueCountFrequency (%)
udemy 7445
20.1%
other 3230
 
8.7%
udemy;coursera 2612
 
7.0%
udemy;pluralsight 1958
 
5.3%
codecademy;udemy 1837
 
5.0%
coursera 1634
 
4.4%
pluralsight 1619
 
4.4%
other;udemy 1370
 
3.7%
codecademy 1248
 
3.4%
codecademy;udemy;coursera 841
 
2.3%
Other values (200) 13282
35.8%
2023-12-09T14:43:31.917371image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 68509
12.8%
d 51854
 
9.7%
r 41946
 
7.9%
y 37303
 
7.0%
; 35627
 
6.7%
a 34306
 
6.4%
m 33311
 
6.2%
U 28288
 
5.3%
o 22605
 
4.2%
s 22053
 
4.1%
Other values (14) 157629
29.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 425101
79.7%
Uppercase Letter 72703
 
13.6%
Other Punctuation 35627
 
6.7%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 68509
16.1%
d 51854
12.2%
r 41946
9.9%
y 37303
8.8%
a 34306
8.1%
m 33311
7.8%
o 22605
 
5.3%
s 22053
 
5.2%
u 21299
 
5.0%
t 21020
 
4.9%
Other values (7) 70895
16.7%
Uppercase Letter
ValueCountFrequency (%)
U 28288
38.9%
C 21851
30.1%
P 8463
 
11.6%
O 7811
 
10.7%
X 5536
 
7.6%
S 754
 
1.0%
Other Punctuation
ValueCountFrequency (%)
; 35627
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 497804
93.3%
Common 35627
 
6.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 68509
13.8%
d 51854
 
10.4%
r 41946
 
8.4%
y 37303
 
7.5%
a 34306
 
6.9%
m 33311
 
6.7%
U 28288
 
5.7%
o 22605
 
4.5%
s 22053
 
4.4%
C 21851
 
4.4%
Other values (13) 135778
27.3%
Common
ValueCountFrequency (%)
; 35627
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 533431
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 68509
12.8%
d 51854
 
9.7%
r 41946
 
7.9%
y 37303
 
7.0%
; 35627
 
6.7%
a 34306
 
6.4%
m 33311
 
6.2%
U 28288
 
5.3%
o 22605
 
4.2%
s 22053
 
4.1%
Other values (14) 157629
29.6%

YearsCode
Text

MISSING 

Distinct52
Distinct (%)0.1%
Missing1749
Missing (%)2.0%
Memory size1.4 MiB
2023-12-09T14:43:32.039398image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length18
Median length2
Mean length1.808177503
Min length1

Characters and Unicode

Total characters158098
Distinct characters22
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row18
2nd row27
3rd row12
4th row6
5th row21
ValueCountFrequency (%)
10 6521
 
7.1%
5 5415
 
5.9%
6 4893
 
5.4%
8 4879
 
5.3%
7 4800
 
5.3%
4 4466
 
4.9%
15 4336
 
4.7%
3 4269
 
4.7%
20 4021
 
4.4%
12 3471
 
3.8%
Other values (45) 44327
48.5%
2023-12-09T14:43:32.224625image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 33669
21.3%
2 21716
13.7%
3 14950
9.5%
0 14400
9.1%
5 14392
9.1%
4 11175
 
7.1%
6 7824
 
4.9%
8 7808
 
4.9%
7 7445
 
4.7%
9 4551
 
2.9%
Other values (12) 20168
12.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 137930
87.2%
Lowercase Letter 14884
 
9.4%
Space Separator 3963
 
2.5%
Uppercase Letter 1321
 
0.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 33669
24.4%
2 21716
15.7%
3 14950
10.8%
0 14400
10.4%
5 14392
10.4%
4 11175
 
8.1%
6 7824
 
5.7%
8 7808
 
5.7%
7 7445
 
5.4%
9 4551
 
3.3%
Lowercase Letter
ValueCountFrequency (%)
a 2642
17.8%
e 2642
17.8%
s 2289
15.4%
r 1674
11.2%
t 1321
8.9%
h 1321
8.9%
n 1321
8.9%
y 1321
8.9%
o 353
 
2.4%
Uppercase Letter
ValueCountFrequency (%)
L 968
73.3%
M 353
 
26.7%
Space Separator
ValueCountFrequency (%)
3963
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 141893
89.8%
Latin 16205
 
10.2%

Most frequent character per script

Common
ValueCountFrequency (%)
1 33669
23.7%
2 21716
15.3%
3 14950
10.5%
0 14400
10.1%
5 14392
10.1%
4 11175
 
7.9%
6 7824
 
5.5%
8 7808
 
5.5%
7 7445
 
5.2%
9 4551
 
3.2%
Latin
ValueCountFrequency (%)
a 2642
16.3%
e 2642
16.3%
s 2289
14.1%
r 1674
10.3%
t 1321
8.2%
h 1321
8.2%
n 1321
8.2%
y 1321
8.2%
L 968
 
6.0%
M 353
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 158098
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 33669
21.3%
2 21716
13.7%
3 14950
9.5%
0 14400
9.1%
5 14392
9.1%
4 11175
 
7.1%
6 7824
 
4.9%
8 7808
 
4.9%
7 7445
 
4.7%
9 4551
 
2.9%
Other values (12) 20168
12.8%

YearsCodePro
Text

MISSING 

Distinct52
Distinct (%)0.1%
Missing23048
Missing (%)25.8%
Memory size1.4 MiB
2023-12-09T14:43:32.344946image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length18
Median length16
Mean length1.91047236
Min length1

Characters and Unicode

Total characters126351
Distinct characters22
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row9
2nd row23
3rd row7
4th row4
5th row21
ValueCountFrequency (%)
5 4792
 
6.7%
10 4594
 
6.4%
2 4464
 
6.2%
1 4432
 
6.2%
3 4378
 
6.1%
4 3970
 
5.5%
6 3637
 
5.1%
7 3509
 
4.9%
8 3462
 
4.8%
15 2789
 
3.9%
Other values (45) 31866
44.3%
2023-12-09T14:43:32.532775image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 26301
20.8%
2 16575
13.1%
3 10431
 
8.3%
5 9701
 
7.7%
0 8181
 
6.5%
4 6932
 
5.5%
5757
 
4.6%
6 5634
 
4.5%
7 5319
 
4.2%
8 5285
 
4.2%
Other values (12) 26235
20.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 97483
77.2%
Lowercase Letter 21192
 
16.8%
Space Separator 5757
 
4.6%
Uppercase Letter 1919
 
1.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 26301
27.0%
2 16575
17.0%
3 10431
 
10.7%
5 9701
 
10.0%
0 8181
 
8.4%
4 6932
 
7.1%
6 5634
 
5.8%
7 5319
 
5.5%
8 5285
 
5.4%
9 3124
 
3.2%
Lowercase Letter
ValueCountFrequency (%)
a 3838
18.1%
e 3838
18.1%
s 3755
17.7%
r 2002
9.4%
t 1919
9.1%
h 1919
9.1%
n 1919
9.1%
y 1919
9.1%
o 83
 
0.4%
Uppercase Letter
ValueCountFrequency (%)
L 1836
95.7%
M 83
 
4.3%
Space Separator
ValueCountFrequency (%)
5757
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 103240
81.7%
Latin 23111
 
18.3%

Most frequent character per script

Common
ValueCountFrequency (%)
1 26301
25.5%
2 16575
16.1%
3 10431
 
10.1%
5 9701
 
9.4%
0 8181
 
7.9%
4 6932
 
6.7%
5757
 
5.6%
6 5634
 
5.5%
7 5319
 
5.2%
8 5285
 
5.1%
Latin
ValueCountFrequency (%)
a 3838
16.6%
e 3838
16.6%
s 3755
16.2%
r 2002
8.7%
t 1919
8.3%
h 1919
8.3%
n 1919
8.3%
y 1919
8.3%
L 1836
7.9%
M 83
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 126351
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 26301
20.8%
2 16575
13.1%
3 10431
 
8.3%
5 9701
 
7.7%
0 8181
 
6.5%
4 6932
 
5.5%
5757
 
4.6%
6 5634
 
4.5%
7 5319
 
4.2%
8 5285
 
4.2%
Other values (12) 26235
20.8%

DevType
Text

MISSING 

Distinct33
Distinct (%)< 0.1%
Missing12312
Missing (%)13.8%
Memory size1.4 MiB
2023-12-09T14:43:32.660367image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length45
Median length43
Mean length22.41416901
Min length7

Characters and Unicode

Total characters1723022
Distinct characters45
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSenior Executive (C-Suite, VP, etc.)
2nd rowDeveloper, back-end
3rd rowDeveloper, front-end
4th rowDeveloper, full-stack
5th rowDeveloper, back-end
ValueCountFrequency (%)
developer 54887
28.9%
full-stack 25735
13.5%
back-end 13745
 
7.2%
or 9775
 
5.1%
applications 5749
 
3.0%
front-end 5071
 
2.7%
desktop 3904
 
2.1%
enterprise 3904
 
2.1%
data 3673
 
1.9%
specify 3080
 
1.6%
Other values (50) 60575
31.9%
2023-12-09T14:43:32.866815image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 278267
16.1%
l 128874
 
7.5%
113226
 
6.6%
r 108683
 
6.3%
o 91288
 
5.3%
p 87883
 
5.1%
a 87835
 
5.1%
t 77530
 
4.5%
s 70956
 
4.1%
c 69745
 
4.0%
Other values (35) 608735
35.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1402368
81.4%
Space Separator 113226
 
6.6%
Uppercase Letter 88268
 
5.1%
Other Punctuation 64453
 
3.7%
Dash Punctuation 45883
 
2.7%
Close Punctuation 4412
 
0.3%
Open Punctuation 4412
 
0.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 278267
19.8%
l 128874
 
9.2%
r 108683
 
7.7%
o 91288
 
6.5%
p 87883
 
6.3%
a 87835
 
6.3%
t 77530
 
5.5%
s 70956
 
5.1%
c 69745
 
5.0%
n 65497
 
4.7%
Other values (14) 335810
23.9%
Uppercase Letter
ValueCountFrequency (%)
D 60590
68.6%
S 6228
 
7.1%
E 6067
 
6.9%
O 4467
 
5.1%
C 2368
 
2.7%
P 2367
 
2.7%
A 2152
 
2.4%
R 1353
 
1.5%
V 1332
 
1.5%
Q 586
 
0.7%
Other values (3) 758
 
0.9%
Other Punctuation
ValueCountFrequency (%)
, 58688
91.1%
: 3080
 
4.8%
& 1353
 
2.1%
. 1332
 
2.1%
Space Separator
ValueCountFrequency (%)
113226
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 45883
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4412
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4412
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1490636
86.5%
Common 232386
 
13.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 278267
18.7%
l 128874
 
8.6%
r 108683
 
7.3%
o 91288
 
6.1%
p 87883
 
5.9%
a 87835
 
5.9%
t 77530
 
5.2%
s 70956
 
4.8%
c 69745
 
4.7%
n 65497
 
4.4%
Other values (27) 424078
28.4%
Common
ValueCountFrequency (%)
113226
48.7%
, 58688
25.3%
- 45883
19.7%
) 4412
 
1.9%
( 4412
 
1.9%
: 3080
 
1.3%
& 1353
 
0.6%
. 1332
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1723022
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 278267
16.1%
l 128874
 
7.5%
113226
 
6.6%
r 108683
 
6.3%
o 91288
 
5.3%
p 87883
 
5.1%
a 87835
 
5.1%
t 77530
 
4.5%
s 70956
 
4.1%
c 69745
 
4.0%
Other values (35) 608735
35.3%

OrgSize
Text

MISSING 

Distinct10
Distinct (%)< 0.1%
Missing24141
Missing (%)27.1%
Memory size1.4 MiB
2023-12-09T14:43:32.975238image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length50
Median length24
Mean length21.91067448
Min length12

Characters and Unicode

Total characters1425136
Distinct characters31
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2 to 9 employees
2nd row5,000 to 9,999 employees
3rd row100 to 499 employees
4th row20 to 99 employees
5th row100 to 499 employees
ValueCountFrequency (%)
employees 59604
21.0%
to 51675
18.2%
20 13380
 
4.7%
99 13380
 
4.7%
100 12218
 
4.3%
499 12218
 
4.3%
10,000 7929
 
2.8%
or 7929
 
2.8%
more 7929
 
2.8%
1,000 7235
 
2.5%
Other values (21) 90608
31.9%
2023-12-09T14:43:33.152463image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
219062
15.4%
e 216113
15.2%
o 142211
10.0%
0 113466
8.0%
9 108718
 
7.6%
m 75925
 
5.3%
p 67996
 
4.8%
l 67996
 
4.8%
s 67996
 
4.8%
t 65506
 
4.6%
Other values (21) 280147
19.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 844164
59.2%
Decimal Number 306495
 
21.5%
Space Separator 219062
 
15.4%
Other Punctuation 40341
 
2.8%
Uppercase Letter 9635
 
0.7%
Dash Punctuation 4196
 
0.3%
Final Punctuation 1243
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 216113
25.6%
o 142211
16.8%
m 75925
 
9.0%
p 67996
 
8.1%
l 67996
 
8.1%
s 67996
 
8.1%
t 65506
 
7.8%
y 59604
 
7.1%
r 36838
 
4.4%
a 12588
 
1.5%
Other values (8) 31391
 
3.7%
Decimal Number
ValueCountFrequency (%)
0 113466
37.0%
9 108718
35.5%
1 37890
 
12.4%
2 19819
 
6.5%
4 19453
 
6.3%
5 7149
 
2.3%
Other Punctuation
ValueCountFrequency (%)
, 36145
89.6%
. 4196
 
10.4%
Uppercase Letter
ValueCountFrequency (%)
I 5439
56.5%
J 4196
43.5%
Space Separator
ValueCountFrequency (%)
219062
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4196
100.0%
Final Punctuation
ValueCountFrequency (%)
1243
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 853799
59.9%
Common 571337
40.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 216113
25.3%
o 142211
16.7%
m 75925
 
8.9%
p 67996
 
8.0%
l 67996
 
8.0%
s 67996
 
8.0%
t 65506
 
7.7%
y 59604
 
7.0%
r 36838
 
4.3%
a 12588
 
1.5%
Other values (10) 41026
 
4.8%
Common
ValueCountFrequency (%)
219062
38.3%
0 113466
19.9%
9 108718
19.0%
1 37890
 
6.6%
, 36145
 
6.3%
2 19819
 
3.5%
4 19453
 
3.4%
5 7149
 
1.3%
- 4196
 
0.7%
. 4196
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1423893
99.9%
Punctuation 1243
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
219062
15.4%
e 216113
15.2%
o 142211
10.0%
0 113466
8.0%
9 108718
 
7.6%
m 75925
 
5.3%
p 67996
 
4.8%
l 67996
 
4.8%
s 67996
 
4.8%
t 65506
 
4.6%
Other values (20) 278904
19.6%
Punctuation
ValueCountFrequency (%)
1243
100.0%

PurchaseInfluence
Text

MISSING 

Distinct3
Distinct (%)< 0.1%
Missing24220
Missing (%)27.2%
Memory size1.4 MiB
2023-12-09T14:43:33.246360image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length32
Median length29
Mean length26.41141247
Min length21

Characters and Unicode

Total characters1715791
Distinct characters19
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowI have a great deal of influence
2nd rowI have some influence
3rd rowI have some influence
4th rowI have some influence
5th rowI have little or no influence
ValueCountFrequency (%)
i 64964
18.5%
have 64964
18.5%
influence 64964
18.5%
some 26805
7.6%
little 22734
 
6.5%
or 22734
 
6.5%
no 22734
 
6.5%
a 15425
 
4.4%
great 15425
 
4.4%
deal 15425
 
4.4%
2023-12-09T14:43:33.397497image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
286635
16.7%
e 275281
16.0%
n 152662
8.9%
l 125857
 
7.3%
a 111239
 
6.5%
i 87698
 
5.1%
o 87698
 
5.1%
f 80389
 
4.7%
c 64964
 
3.8%
u 64964
 
3.8%
Other values (9) 378404
22.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1364192
79.5%
Space Separator 286635
 
16.7%
Uppercase Letter 64964
 
3.8%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 275281
20.2%
n 152662
11.2%
l 125857
9.2%
a 111239
8.2%
i 87698
 
6.4%
o 87698
 
6.4%
f 80389
 
5.9%
c 64964
 
4.8%
u 64964
 
4.8%
v 64964
 
4.8%
Other values (7) 248476
18.2%
Space Separator
ValueCountFrequency (%)
286635
100.0%
Uppercase Letter
ValueCountFrequency (%)
I 64964
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1429156
83.3%
Common 286635
 
16.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 275281
19.3%
n 152662
10.7%
l 125857
8.8%
a 111239
 
7.8%
i 87698
 
6.1%
o 87698
 
6.1%
f 80389
 
5.6%
c 64964
 
4.5%
u 64964
 
4.5%
I 64964
 
4.5%
Other values (8) 313440
21.9%
Common
ValueCountFrequency (%)
286635
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1715791
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
286635
16.7%
e 275281
16.0%
n 152662
8.9%
l 125857
 
7.3%
a 111239
 
6.5%
i 87698
 
5.1%
o 87698
 
5.1%
f 80389
 
4.7%
c 64964
 
3.8%
u 64964
 
3.8%
Other values (9) 378404
22.1%

TechList
Text

MISSING 

Distinct3
Distinct (%)< 0.1%
Missing28333
Missing (%)31.8%
Memory size1.4 MiB
2023-12-09T14:43:33.484412image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length12
Median length11
Mean length10.76518052
Min length5

Characters and Unicode

Total characters655072
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowInvestigate
2nd rowGiven a list
3rd rowInvestigate
4th rowInvestigate
5th rowInvestigate
ValueCountFrequency (%)
investigate 49212
64.1%
given 7935
 
10.3%
a 7935
 
10.3%
list 7935
 
10.3%
other 3704
 
4.8%
2023-12-09T14:43:33.636311image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 110063
16.8%
t 110063
16.8%
i 65082
9.9%
n 57147
8.7%
v 57147
8.7%
s 57147
8.7%
a 57147
8.7%
I 49212
7.5%
g 49212
7.5%
15870
 
2.4%
Other values (5) 26982
 
4.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 578351
88.3%
Uppercase Letter 60851
 
9.3%
Space Separator 15870
 
2.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 110063
19.0%
t 110063
19.0%
i 65082
11.3%
n 57147
9.9%
v 57147
9.9%
s 57147
9.9%
a 57147
9.9%
g 49212
8.5%
l 7935
 
1.4%
h 3704
 
0.6%
Uppercase Letter
ValueCountFrequency (%)
I 49212
80.9%
G 7935
 
13.0%
O 3704
 
6.1%
Space Separator
ValueCountFrequency (%)
15870
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 639202
97.6%
Common 15870
 
2.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 110063
17.2%
t 110063
17.2%
i 65082
10.2%
n 57147
8.9%
v 57147
8.9%
s 57147
8.9%
a 57147
8.9%
I 49212
7.7%
g 49212
7.7%
G 7935
 
1.2%
Other values (4) 19047
 
3.0%
Common
ValueCountFrequency (%)
15870
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 655072
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 110063
16.8%
t 110063
16.8%
i 65082
9.9%
n 57147
8.7%
v 57147
8.7%
s 57147
8.7%
a 57147
8.7%
I 49212
7.5%
g 49212
7.5%
15870
 
2.4%
Other values (5) 26982
 
4.1%

BuyNewTool
Text

MISSING 

Distinct231
Distinct (%)0.3%
Missing6175
Missing (%)6.9%
Memory size1.4 MiB
2023-12-09T14:43:33.751927image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length303
Median length260
Mean length102.563216
Min length18

Characters and Unicode

Total characters8513670
Distinct characters36
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique14 ?
Unique (%)< 0.1%

Sample

1st rowStart a free trial;Ask developers I know/work with;Visit developer communities like Stack Overflow;Other (please specify):
2nd rowStart a free trial;Ask developers I know/work with;Visit developer communities like Stack Overflow
3rd rowStart a free trial;Ask developers I know/work with;Visit developer communities like Stack Overflow
4th rowStart a free trial;Ask developers I know/work with;Visit developer communities like Stack Overflow;Research companies that have advertised on sites I visit
5th rowStart a free trial;Ask developers I know/work with;Visit developer communities like Stack Overflow
ValueCountFrequency (%)
like 81149
 
6.6%
a 73985
 
6.0%
i 71293
 
5.8%
start 61210
 
5.0%
free 61210
 
5.0%
developers 58955
 
4.8%
know/work 58955
 
4.8%
developer 53221
 
4.3%
communities 53221
 
4.3%
stack 53221
 
4.3%
Other values (51) 605627
49.2%
2023-12-09T14:43:33.966971image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1149038
13.5%
e 938141
 
11.0%
r 623401
 
7.3%
t 616133
 
7.2%
i 615308
 
7.2%
o 475071
 
5.6%
s 443667
 
5.2%
a 436407
 
5.1%
l 330931
 
3.9%
k 324010
 
3.8%
Other values (26) 2561563
30.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 6610362
77.6%
Space Separator 1149038
 
13.5%
Uppercase Letter 495968
 
5.8%
Other Punctuation 218640
 
2.6%
Decimal Number 27928
 
0.3%
Open Punctuation 5867
 
0.1%
Close Punctuation 5867
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 938141
14.2%
r 623401
9.4%
t 616133
9.3%
i 615308
9.3%
o 475071
 
7.2%
s 443667
 
6.7%
a 436407
 
6.6%
l 330931
 
5.0%
k 324010
 
4.9%
w 285942
 
4.3%
Other values (11) 1521351
23.0%
Uppercase Letter
ValueCountFrequency (%)
S 114431
23.1%
A 84505
17.0%
I 84068
17.0%
O 59088
11.9%
V 53221
10.7%
R 44799
 
9.0%
G 27928
 
5.6%
C 27928
 
5.6%
Other Punctuation
ValueCountFrequency (%)
; 153818
70.4%
/ 58955
 
27.0%
: 5867
 
2.7%
Space Separator
ValueCountFrequency (%)
1149038
100.0%
Decimal Number
ValueCountFrequency (%)
2 27928
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5867
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5867
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 7106330
83.5%
Common 1407340
 
16.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 938141
13.2%
r 623401
 
8.8%
t 616133
 
8.7%
i 615308
 
8.7%
o 475071
 
6.7%
s 443667
 
6.2%
a 436407
 
6.1%
l 330931
 
4.7%
k 324010
 
4.6%
w 285942
 
4.0%
Other values (19) 2017319
28.4%
Common
ValueCountFrequency (%)
1149038
81.6%
; 153818
 
10.9%
/ 58955
 
4.2%
2 27928
 
2.0%
( 5867
 
0.4%
) 5867
 
0.4%
: 5867
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 8513670
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1149038
13.5%
e 938141
 
11.0%
r 623401
 
7.3%
t 616133
 
7.2%
i 615308
 
7.2%
o 475071
 
5.6%
s 443667
 
5.2%
a 436407
 
5.1%
l 330931
 
3.9%
k 324010
 
3.8%
Other values (26) 2561563
30.1%

Country
Text

MISSING 

Distinct185
Distinct (%)0.2%
Missing1211
Missing (%)1.4%
Memory size1.4 MiB
2023-12-09T14:43:34.134409image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length52
Median length37
Mean length13.93950417
Min length4

Characters and Unicode

Total characters1226300
Distinct characters58
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8 ?
Unique (%)< 0.1%

Sample

1st rowUnited States of America
2nd rowUnited States of America
3rd rowUnited States of America
4th rowPhilippines
5th rowUnited Kingdom of Great Britain and Northern Ireland
ValueCountFrequency (%)
of 25127
 
13.1%
united 24413
 
12.8%
america 18647
 
9.8%
states 18647
 
9.8%
germany 7328
 
3.8%
ireland 6016
 
3.1%
and 5668
 
3.0%
india 5625
 
2.9%
kingdom 5552
 
2.9%
great 5552
 
2.9%
Other values (214) 68611
35.9%
2023-12-09T14:43:34.378520image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 134836
 
11.0%
e 118916
 
9.7%
103213
 
8.4%
n 98990
 
8.1%
i 95166
 
7.8%
t 93258
 
7.6%
r 80021
 
6.5%
d 62882
 
5.1%
o 49354
 
4.0%
m 36443
 
3.0%
Other values (48) 353221
28.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 958556
78.2%
Uppercase Letter 160744
 
13.1%
Space Separator 103213
 
8.4%
Other Punctuation 3374
 
0.3%
Open Punctuation 205
 
< 0.1%
Close Punctuation 205
 
< 0.1%
Dash Punctuation 3
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 134836
14.1%
e 118916
12.4%
n 98990
10.3%
i 95166
9.9%
t 93258
9.7%
r 80021
8.3%
d 62882
 
6.6%
o 49354
 
5.1%
m 36443
 
3.8%
s 31433
 
3.3%
Other values (17) 157257
16.4%
Uppercase Letter
ValueCountFrequency (%)
S 26049
16.2%
U 25543
15.9%
A 23857
14.8%
I 16180
10.1%
G 13838
8.6%
N 10492
6.5%
B 9892
 
6.2%
C 6526
 
4.1%
K 6459
 
4.0%
F 4765
 
3.0%
Other values (14) 17143
10.7%
Other Punctuation
ValueCountFrequency (%)
. 2670
79.1%
, 685
 
20.3%
' 19
 
0.6%
Space Separator
ValueCountFrequency (%)
103213
100.0%
Open Punctuation
ValueCountFrequency (%)
( 205
100.0%
Close Punctuation
ValueCountFrequency (%)
) 205
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1119300
91.3%
Common 107000
 
8.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 134836
12.0%
e 118916
 
10.6%
n 98990
 
8.8%
i 95166
 
8.5%
t 93258
 
8.3%
r 80021
 
7.1%
d 62882
 
5.6%
o 49354
 
4.4%
m 36443
 
3.3%
s 31433
 
2.8%
Other values (41) 318001
28.4%
Common
ValueCountFrequency (%)
103213
96.5%
. 2670
 
2.5%
, 685
 
0.6%
( 205
 
0.2%
) 205
 
0.2%
' 19
 
< 0.1%
- 3
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1226286
> 99.9%
None 14
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 134836
 
11.0%
e 118916
 
9.7%
103213
 
8.4%
n 98990
 
8.1%
i 95166
 
7.8%
t 93258
 
7.6%
r 80021
 
6.5%
d 62882
 
5.1%
o 49354
 
4.0%
m 36443
 
3.0%
Other values (47) 353207
28.8%
None
ValueCountFrequency (%)
ô 14
100.0%

Currency
Text

MISSING 

Distinct144
Distinct (%)0.2%
Missing23850
Missing (%)26.7%
Memory size1.4 MiB
2023-12-09T14:43:34.562730image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length43
Median length33
Mean length19.35586372
Min length11

Characters and Unicode

Total characters1264596
Distinct characters55
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9 ?
Unique (%)< 0.1%

Sample

1st rowUSD United States dollar
2nd rowUSD United States dollar
3rd rowUSD United States dollar
4th rowPHP Philippine peso
5th rowGBP Pound sterling
ValueCountFrequency (%)
dollar 22053
 
10.2%
eur 17651
 
8.2%
european 17651
 
8.2%
euro 17651
 
8.2%
united 16894
 
7.8%
usd 16729
 
7.7%
states 16729
 
7.7%
pound 4629
 
2.1%
sterling 4473
 
2.1%
gbp 4473
 
2.1%
Other values (371) 77184
35.7%
2023-12-09T14:43:34.821547image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
103293
 
8.2%
a 99699
 
7.9%
r 85503
 
6.8%
e 83279
 
6.6%
n 82752
 
6.5%
o 74795
 
5.9%
l 63316
 
5.0%
t 61234
 
4.8%
E 55187
 
4.4%
U 55086
 
4.4%
Other values (45) 500452
39.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 815062
64.5%
Uppercase Letter 298749
 
23.6%
Space Separator 103293
 
8.2%
Control 47490
 
3.8%
Final Punctuation 2
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 99699
12.2%
r 85503
10.5%
e 83279
10.2%
n 82752
10.2%
o 74795
9.2%
l 63316
7.8%
t 61234
7.5%
d 54003
6.6%
i 53243
6.5%
u 51436
6.3%
Other values (16) 105802
13.0%
Uppercase Letter
ValueCountFrequency (%)
E 55187
18.5%
U 55086
18.4%
S 39940
13.4%
R 28951
9.7%
D 24982
8.4%
P 14698
 
4.9%
N 10313
 
3.5%
I 10002
 
3.3%
B 9827
 
3.3%
C 9348
 
3.1%
Other values (16) 40415
13.5%
Space Separator
ValueCountFrequency (%)
103293
100.0%
Control
ValueCountFrequency (%)
47490
100.0%
Final Punctuation
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1113811
88.1%
Common 150785
 
11.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 99699
 
9.0%
r 85503
 
7.7%
e 83279
 
7.5%
n 82752
 
7.4%
o 74795
 
6.7%
l 63316
 
5.7%
t 61234
 
5.5%
E 55187
 
5.0%
U 55086
 
4.9%
d 54003
 
4.8%
Other values (42) 398957
35.8%
Common
ValueCountFrequency (%)
103293
68.5%
47490
31.5%
2
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1264594
> 99.9%
Punctuation 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
103293
 
8.2%
a 99699
 
7.9%
r 85503
 
6.8%
e 83279
 
6.6%
n 82752
 
6.5%
o 74795
 
5.9%
l 63316
 
5.0%
t 61234
 
4.8%
E 55187
 
4.4%
U 55086
 
4.4%
Other values (44) 500450
39.6%
Punctuation
ValueCountFrequency (%)
2
100.0%

CompTotal
Real number (ℝ)

MISSING  SKEWED 

Distinct3828
Distinct (%)7.9%
Missing40959
Missing (%)45.9%
Infinite0
Infinite (%)0.0%
Mean1.036806636 × 1042
Minimum0
Maximum5 × 1046
Zeros130
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size1.4 MiB
2023-12-09T14:43:34.926320image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile25920
Q163000
median115000
Q3230000
95-th percentile3000000
Maximum5 × 1046
Range5 × 1046
Interquartile range (IQR)167000

Descriptive statistics

Standard deviation2.276847201 × 1044
Coefficient of variation (CV)219.6019126
Kurtosis48225
Mean1.036806636 × 1042
Median Absolute Deviation (MAD)65000
Skewness219.6019126
Sum5 × 1046
Variance5.184033178 × 1088
MonotonicityNot monotonic
2023-12-09T14:43:35.003722image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
100000 1355
 
1.5%
60000 1290
 
1.4%
120000 1240
 
1.4%
150000 1150
 
1.3%
80000 1066
 
1.2%
70000 997
 
1.1%
200000 991
 
1.1%
50000 954
 
1.1%
90000 862
 
1.0%
130000 711
 
0.8%
Other values (3818) 37609
42.2%
(Missing) 40959
45.9%
ValueCountFrequency (%)
0 130
0.1%
1 9
 
< 0.1%
2 1
 
< 0.1%
3 5
 
< 0.1%
4 3
 
< 0.1%
ValueCountFrequency (%)
5 × 10461
< 0.1%
1 × 10211
< 0.1%
1 × 10162
< 0.1%
1 × 10151
< 0.1%
4.57 × 10121
< 0.1%
Distinct32641
Distinct (%)37.5%
Missing2044
Missing (%)2.3%
Memory size1.4 MiB
2023-12-09T14:43:35.114087image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length340
Median length205
Mean length41.30784944
Min length1

Characters and Unicode

Total characters3599566
Distinct characters54
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25997 ?
Unique (%)29.8%

Sample

1st rowHTML/CSS;JavaScript;Python
2nd rowBash/Shell (all shells);Go
3rd rowBash/Shell (all shells);HTML/CSS;JavaScript;PHP;Ruby;SQL;TypeScript
4th rowHTML/CSS;JavaScript;TypeScript
5th rowBash/Shell (all shells);HTML/CSS;JavaScript;Ruby;SQL;TypeScript
ValueCountFrequency (%)
all 28351
 
18.8%
bash/shell 24935
 
16.5%
basic 3568
 
2.4%
net 3508
 
2.3%
assembly;bash/shell 2928
 
1.9%
html/css;javascript;typescript 1487
 
1.0%
python 1132
 
0.7%
c 976
 
0.6%
html/css;javascript 735
 
0.5%
html/css;javascript;php;sql 718
 
0.5%
Other values (31964) 82640
54.7%
2023-12-09T14:43:35.337577image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
; 379017
 
10.5%
S 275431
 
7.7%
a 250603
 
7.0%
l 229976
 
6.4%
t 171089
 
4.8%
h 142944
 
4.0%
e 135401
 
3.8%
p 130040
 
3.6%
i 125044
 
3.5%
r 120381
 
3.3%
Other values (44) 1639640
45.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1918925
53.3%
Uppercase Letter 1029304
28.6%
Other Punctuation 482374
 
13.4%
Space Separator 63838
 
1.8%
Math Symbol 39268
 
1.1%
Open Punctuation 31919
 
0.9%
Close Punctuation 31919
 
0.9%
Dash Punctuation 2019
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 250603
13.1%
l 229976
12.0%
t 171089
8.9%
h 142944
 
7.4%
e 135401
 
7.1%
p 130040
 
6.8%
i 125044
 
6.5%
r 120381
 
6.3%
s 116682
 
6.1%
c 99256
 
5.2%
Other values (14) 397509
20.7%
Uppercase Letter
ValueCountFrequency (%)
S 275431
26.8%
C 111866
10.9%
L 99261
 
9.6%
P 90760
 
8.8%
T 83776
 
8.1%
J 83478
 
8.1%
H 64499
 
6.3%
M 49735
 
4.8%
Q 42623
 
4.1%
B 38365
 
3.7%
Other values (11) 89510
 
8.7%
Other Punctuation
ValueCountFrequency (%)
; 379017
78.6%
/ 74747
 
15.5%
# 25042
 
5.2%
. 3568
 
0.7%
Space Separator
ValueCountFrequency (%)
63838
100.0%
Math Symbol
ValueCountFrequency (%)
+ 39268
100.0%
Open Punctuation
ValueCountFrequency (%)
( 31919
100.0%
Close Punctuation
ValueCountFrequency (%)
) 31919
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2019
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 2948229
81.9%
Common 651337
 
18.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
S 275431
 
9.3%
a 250603
 
8.5%
l 229976
 
7.8%
t 171089
 
5.8%
h 142944
 
4.8%
e 135401
 
4.6%
p 130040
 
4.4%
i 125044
 
4.2%
r 120381
 
4.1%
s 116682
 
4.0%
Other values (35) 1250638
42.4%
Common
ValueCountFrequency (%)
; 379017
58.2%
/ 74747
 
11.5%
63838
 
9.8%
+ 39268
 
6.0%
( 31919
 
4.9%
) 31919
 
4.9%
# 25042
 
3.8%
. 3568
 
0.5%
- 2019
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3599566
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
; 379017
 
10.5%
S 275431
 
7.7%
a 250603
 
7.0%
l 229976
 
6.4%
t 171089
 
4.8%
h 142944
 
4.0%
e 135401
 
3.8%
p 130040
 
3.6%
i 125044
 
3.5%
r 120381
 
3.3%
Other values (44) 1639640
45.6%
Distinct29602
Distinct (%)36.7%
Missing8475
Missing (%)9.5%
Memory size1.4 MiB
2023-12-09T14:43:35.470606image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length340
Median length271
Mean length33.63966844
Min length1

Characters and Unicode

Total characters2715024
Distinct characters54
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23574 ?
Unique (%)29.2%

Sample

1st rowBash/Shell (all shells);C#;Dart;Elixir;GDScript;HTML/CSS;JavaScript;Rust
2nd rowHaskell;OCaml;Rust
3rd rowBash/Shell (all shells);HTML/CSS;JavaScript;Ruby;TypeScript
4th rowHTML/CSS;JavaScript;Python;Rust;TypeScript
5th rowGo;Rust
ValueCountFrequency (%)
all 18279
 
15.2%
bash/shell 16027
 
13.4%
assembly;bash/shell 1842
 
1.5%
rust 1438
 
1.2%
basic 1350
 
1.1%
net 1262
 
1.1%
python 1182
 
1.0%
c 1168
 
1.0%
html/css;javascript;typescript 1071
 
0.9%
go 618
 
0.5%
Other values (29208) 75730
63.1%
2023-12-09T14:43:35.694881image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
; 293515
 
10.8%
S 193146
 
7.1%
l 164015
 
6.0%
a 163765
 
6.0%
t 157241
 
5.8%
i 109655
 
4.0%
p 105197
 
3.9%
h 97642
 
3.6%
s 97571
 
3.6%
e 96963
 
3.6%
Other values (44) 1236314
45.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1505661
55.5%
Uppercase Letter 737665
27.2%
Other Punctuation 363766
 
13.4%
Space Separator 39258
 
1.4%
Math Symbol 28462
 
1.0%
Close Punctuation 19629
 
0.7%
Open Punctuation 19629
 
0.7%
Dash Punctuation 954
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
l 164015
10.9%
a 163765
10.9%
t 157241
10.4%
i 109655
 
7.3%
p 105197
 
7.0%
h 97642
 
6.5%
s 97571
 
6.5%
e 96963
 
6.4%
r 92683
 
6.2%
y 78511
 
5.2%
Other values (14) 342418
22.7%
Uppercase Letter
ValueCountFrequency (%)
S 193146
26.2%
C 77722
10.5%
L 66859
 
9.1%
T 63323
 
8.6%
P 58928
 
8.0%
J 51554
 
7.0%
H 41302
 
5.6%
R 34350
 
4.7%
M 31067
 
4.2%
Q 29598
 
4.0%
Other values (11) 89816
12.2%
Other Punctuation
ValueCountFrequency (%)
; 293515
80.7%
/ 48207
 
13.3%
# 20694
 
5.7%
. 1350
 
0.4%
Space Separator
ValueCountFrequency (%)
39258
100.0%
Math Symbol
ValueCountFrequency (%)
+ 28462
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19629
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19629
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 954
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 2243326
82.6%
Common 471698
 
17.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
S 193146
 
8.6%
l 164015
 
7.3%
a 163765
 
7.3%
t 157241
 
7.0%
i 109655
 
4.9%
p 105197
 
4.7%
h 97642
 
4.4%
s 97571
 
4.3%
e 96963
 
4.3%
r 92683
 
4.1%
Other values (35) 965448
43.0%
Common
ValueCountFrequency (%)
; 293515
62.2%
/ 48207
 
10.2%
39258
 
8.3%
+ 28462
 
6.0%
# 20694
 
4.4%
) 19629
 
4.2%
( 19629
 
4.2%
. 1350
 
0.3%
- 954
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2715024
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
; 293515
 
10.8%
S 193146
 
7.1%
l 164015
 
6.0%
a 163765
 
6.0%
t 157241
 
5.8%
i 109655
 
4.0%
p 105197
 
3.9%
h 97642
 
3.6%
s 97571
 
3.6%
e 96963
 
3.6%
Other values (44) 1236314
45.5%
Distinct11096
Distinct (%)15.1%
Missing15749
Missing (%)17.7%
Memory size1.4 MiB
2023-12-09T14:43:35.817313image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length314
Median length216
Mean length29.63427521
Min length2

Characters and Unicode

Total characters2176193
Distinct characters45
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7608 ?
Unique (%)10.4%

Sample

1st rowSupabase
2nd rowPostgreSQL;Redis
3rd rowBigQuery;Elasticsearch;MongoDB;PostgreSQL
4th rowBigQuery;Cloud Firestore;PostgreSQL;Redis
5th rowMariaDB;Microsoft SQL Server;MySQL;PostgreSQL;SQLite
ValueCountFrequency (%)
sql 19506
 
14.4%
microsoft 10648
 
7.9%
realtime 4939
 
3.6%
server 4568
 
3.4%
postgresql 4350
 
3.2%
cloud 4153
 
3.1%
mysql 3116
 
2.3%
access;microsoft 2357
 
1.7%
sqlite 2299
 
1.7%
cosmos 2205
 
1.6%
Other values (6377) 77315
57.1%
2023-12-09T14:43:36.030212image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 180418
 
8.3%
r 152361
 
7.0%
o 150100
 
6.9%
; 148811
 
6.8%
S 134219
 
6.2%
s 127897
 
5.9%
Q 113038
 
5.2%
L 109582
 
5.0%
i 107627
 
4.9%
t 106636
 
4.9%
Other values (35) 845504
38.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1305199
60.0%
Uppercase Letter 654508
30.1%
Other Punctuation 148811
 
6.8%
Space Separator 62021
 
2.8%
Decimal Number 5654
 
0.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 180418
13.8%
r 152361
11.7%
o 150100
11.5%
s 127897
9.8%
i 107627
8.2%
t 106636
8.2%
a 100225
7.7%
c 61947
 
4.7%
g 57921
 
4.4%
y 41743
 
3.2%
Other values (13) 218324
16.7%
Uppercase Letter
ValueCountFrequency (%)
S 134219
20.5%
Q 113038
17.3%
L 109582
16.7%
M 88717
13.6%
D 53418
 
8.2%
B 45838
 
7.0%
P 34909
 
5.3%
R 20805
 
3.2%
C 12629
 
1.9%
F 10992
 
1.7%
Other values (8) 30361
 
4.6%
Decimal Number
ValueCountFrequency (%)
2 4222
74.7%
4 1432
 
25.3%
Other Punctuation
ValueCountFrequency (%)
; 148811
100.0%
Space Separator
ValueCountFrequency (%)
62021
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1959707
90.1%
Common 216486
 
9.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 180418
 
9.2%
r 152361
 
7.8%
o 150100
 
7.7%
S 134219
 
6.8%
s 127897
 
6.5%
Q 113038
 
5.8%
L 109582
 
5.6%
i 107627
 
5.5%
t 106636
 
5.4%
a 100225
 
5.1%
Other values (31) 677604
34.6%
Common
ValueCountFrequency (%)
; 148811
68.7%
62021
28.6%
2 4222
 
2.0%
4 1432
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2176193
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 180418
 
8.3%
r 152361
 
7.0%
o 150100
 
6.9%
; 148811
 
6.8%
S 134219
 
6.2%
s 127897
 
5.9%
Q 113038
 
5.2%
L 109582
 
5.0%
i 107627
 
4.9%
t 106636
 
4.9%
Other values (35) 845504
38.9%
Distinct10485
Distinct (%)17.2%
Missing28273
Missing (%)31.7%
Memory size1.4 MiB
2023-12-09T14:43:36.156768image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length314
Median length268
Mean length28.33108962
Min length2

Characters and Unicode

Total characters1725675
Distinct characters45
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7436 ?
Unique (%)12.2%

Sample

1st rowFirebase Realtime Database;Supabase
2nd rowPostgreSQL;Redis
3rd rowElasticsearch;MongoDB;PostgreSQL;Redis;Supabase
4th rowDatomic
5th rowBigQuery;Cockroachdb;DuckDB;Elasticsearch;MongoDB;Neo4J;PostgreSQL;Redis;Snowflake;SQLite
ValueCountFrequency (%)
sql 11611
 
11.5%
microsoft 6276
 
6.2%
postgresql 4815
 
4.8%
realtime 4159
 
4.1%
server 3010
 
3.0%
cloud 2729
 
2.7%
cosmos 2008
 
2.0%
sqlite 1771
 
1.8%
mysql 1754
 
1.7%
postgresql;sqlite 1631
 
1.6%
Other values (7105) 61421
60.7%
2023-12-09T14:43:36.373533image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 146314
 
8.5%
o 120638
 
7.0%
; 117286
 
6.8%
s 112781
 
6.5%
r 112140
 
6.5%
S 96528
 
5.6%
a 90667
 
5.3%
i 84112
 
4.9%
t 83311
 
4.8%
Q 81506
 
4.7%
Other values (35) 680392
39.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1072586
62.2%
Uppercase Letter 491276
28.5%
Other Punctuation 117286
 
6.8%
Space Separator 40274
 
2.3%
Decimal Number 4253
 
0.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 146314
13.6%
o 120638
11.2%
s 112781
10.5%
r 112140
10.5%
a 90667
8.5%
i 84112
7.8%
t 83311
7.8%
g 51726
 
4.8%
c 45566
 
4.2%
d 32634
 
3.0%
Other values (13) 192697
18.0%
Uppercase Letter
ValueCountFrequency (%)
S 96528
19.6%
Q 81506
16.6%
L 77863
15.8%
M 55610
11.3%
D 43868
8.9%
B 36605
 
7.5%
P 31550
 
6.4%
R 21425
 
4.4%
C 14769
 
3.0%
E 9744
 
2.0%
Other values (8) 21808
 
4.4%
Decimal Number
ValueCountFrequency (%)
4 2377
55.9%
2 1876
44.1%
Other Punctuation
ValueCountFrequency (%)
; 117286
100.0%
Space Separator
ValueCountFrequency (%)
40274
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1563862
90.6%
Common 161813
 
9.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 146314
 
9.4%
o 120638
 
7.7%
s 112781
 
7.2%
r 112140
 
7.2%
S 96528
 
6.2%
a 90667
 
5.8%
i 84112
 
5.4%
t 83311
 
5.3%
Q 81506
 
5.2%
L 77863
 
5.0%
Other values (31) 558002
35.7%
Common
ValueCountFrequency (%)
; 117286
72.5%
40274
 
24.9%
4 2377
 
1.5%
2 1876
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1725675
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 146314
 
8.5%
o 120638
 
7.0%
; 117286
 
6.8%
s 112781
 
6.5%
r 112140
 
6.5%
S 96528
 
5.6%
a 90667
 
5.3%
i 84112
 
4.9%
t 83311
 
4.8%
Q 81506
 
4.7%
Other values (35) 680392
39.4%
Distinct5920
Distinct (%)9.3%
Missing25556
Missing (%)28.7%
Memory size1.4 MiB
2023-12-09T14:43:36.501163image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length278
Median length198
Mean length33.46091343
Min length3

Characters and Unicode

Total characters2129051
Distinct characters45
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3729 ?
Unique (%)5.9%

Sample

1st rowAmazon Web Services (AWS);Netlify;Vercel
2nd rowAmazon Web Services (AWS);Google Cloud;OpenStack;VMware;Vultr
3rd rowCloudflare;Heroku
4th rowAmazon Web Services (AWS);Firebase;Heroku;Netlify;Vercel
5th rowAmazon Web Services (AWS);Cloudflare;Google Cloud
ValueCountFrequency (%)
amazon 33818
15.0%
services 33818
15.0%
web 33818
15.0%
azure 13198
 
5.9%
aws 9804
 
4.4%
cloud 9242
 
4.1%
microsoft 6764
 
3.0%
google 4250
 
1.9%
aws);google 4159
 
1.9%
aws);digital 3211
 
1.4%
Other values (1911) 72718
32.3%
2023-12-09T14:43:36.715356image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 232824
 
10.9%
161172
 
7.6%
o 153202
 
7.2%
r 125895
 
5.9%
i 99216
 
4.7%
a 96258
 
4.5%
A 88496
 
4.2%
l 85892
 
4.0%
; 83884
 
3.9%
c 74474
 
3.5%
Other values (35) 927738
43.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1417081
66.6%
Uppercase Letter 391254
 
18.4%
Space Separator 161172
 
7.6%
Other Punctuation 88288
 
4.1%
Close Punctuation 35628
 
1.7%
Open Punctuation 35628
 
1.7%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 232824
16.4%
o 153202
10.8%
r 125895
 
8.9%
i 99216
 
7.0%
a 96258
 
6.8%
l 85892
 
6.1%
c 74474
 
5.3%
s 67674
 
4.8%
n 63596
 
4.5%
u 61242
 
4.3%
Other values (13) 356808
25.2%
Uppercase Letter
ValueCountFrequency (%)
A 88496
22.6%
S 71011
18.1%
W 68437
17.5%
C 32109
 
8.2%
M 26249
 
6.7%
O 18845
 
4.8%
G 16592
 
4.2%
H 16184
 
4.1%
V 16130
 
4.1%
F 12410
 
3.2%
Other values (6) 24791
 
6.3%
Other Punctuation
ValueCountFrequency (%)
; 83884
95.0%
, 2755
 
3.1%
. 1649
 
1.9%
Space Separator
ValueCountFrequency (%)
161172
100.0%
Close Punctuation
ValueCountFrequency (%)
) 35628
100.0%
Open Punctuation
ValueCountFrequency (%)
( 35628
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1808335
84.9%
Common 320716
 
15.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 232824
 
12.9%
o 153202
 
8.5%
r 125895
 
7.0%
i 99216
 
5.5%
a 96258
 
5.3%
A 88496
 
4.9%
l 85892
 
4.7%
c 74474
 
4.1%
S 71011
 
3.9%
W 68437
 
3.8%
Other values (29) 712630
39.4%
Common
ValueCountFrequency (%)
161172
50.3%
; 83884
26.2%
) 35628
 
11.1%
( 35628
 
11.1%
, 2755
 
0.9%
. 1649
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2129051
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 232824
 
10.9%
161172
 
7.6%
o 153202
 
7.2%
r 125895
 
5.9%
i 99216
 
4.7%
a 96258
 
4.5%
A 88496
 
4.2%
l 85892
 
4.0%
; 83884
 
3.9%
c 74474
 
3.5%
Other values (35) 927738
43.6%
Distinct4963
Distinct (%)9.7%
Missing37876
Missing (%)42.5%
Memory size1.4 MiB
2023-12-09T14:43:36.839693image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length278
Median length239
Mean length33.79439853
Min length3

Characters and Unicode

Total characters1733923
Distinct characters45
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3130 ?
Unique (%)6.1%

Sample

1st rowFly.io;Netlify;Render
2nd rowCloudflare;Heroku
3rd rowAmazon Web Services (AWS);Cloudflare;Digital Ocean;Netlify;Vercel
4th rowVercel
5th rowAmazon Web Services (AWS);Digital Ocean;Fly.io;Linode, now Akamai;Vercel
ValueCountFrequency (%)
amazon 27311
14.9%
web 27311
14.9%
services 27311
14.9%
azure 10691
 
5.8%
cloud 7870
 
4.3%
aws 7269
 
4.0%
microsoft 4845
 
2.6%
aws);google 4083
 
2.2%
cloud;microsoft 3206
 
1.8%
google 3099
 
1.7%
Other values (1763) 59849
32.7%
2023-12-09T14:43:37.049372image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 189364
 
10.9%
131537
 
7.6%
o 125496
 
7.2%
r 100481
 
5.8%
i 80861
 
4.7%
a 77749
 
4.5%
l 76202
 
4.4%
A 72118
 
4.2%
; 67806
 
3.9%
c 62570
 
3.6%
Other values (35) 749739
43.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1156254
66.7%
Uppercase Letter 315259
 
18.2%
Space Separator 131537
 
7.6%
Other Punctuation 73241
 
4.2%
Open Punctuation 28816
 
1.7%
Close Punctuation 28816
 
1.7%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 189364
16.4%
o 125496
10.9%
r 100481
 
8.7%
i 80861
 
7.0%
a 77749
 
6.7%
l 76202
 
6.6%
c 62570
 
5.4%
s 54116
 
4.7%
n 52417
 
4.5%
u 47775
 
4.1%
Other values (13) 289223
25.0%
Uppercase Letter
ValueCountFrequency (%)
A 72118
22.9%
S 57852
18.4%
W 55339
17.6%
C 28111
 
8.9%
M 19571
 
6.2%
O 15038
 
4.8%
G 13972
 
4.4%
V 13444
 
4.3%
F 10992
 
3.5%
H 8731
 
2.8%
Other values (6) 20091
 
6.4%
Other Punctuation
ValueCountFrequency (%)
; 67806
92.6%
, 2807
 
3.8%
. 2628
 
3.6%
Space Separator
ValueCountFrequency (%)
131537
100.0%
Open Punctuation
ValueCountFrequency (%)
( 28816
100.0%
Close Punctuation
ValueCountFrequency (%)
) 28816
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1471513
84.9%
Common 262410
 
15.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 189364
 
12.9%
o 125496
 
8.5%
r 100481
 
6.8%
i 80861
 
5.5%
a 77749
 
5.3%
l 76202
 
5.2%
A 72118
 
4.9%
c 62570
 
4.3%
S 57852
 
3.9%
W 55339
 
3.8%
Other values (29) 573481
39.0%
Common
ValueCountFrequency (%)
131537
50.1%
; 67806
25.8%
( 28816
 
11.0%
) 28816
 
11.0%
, 2807
 
1.1%
. 2628
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1733923
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 189364
 
10.9%
131537
 
7.6%
o 125496
 
7.2%
r 100481
 
5.8%
i 80861
 
4.7%
a 77749
 
4.5%
l 76202
 
4.4%
A 72118
 
4.2%
; 67806
 
3.9%
c 62570
 
3.6%
Other values (35) 749739
43.2%
Distinct15144
Distinct (%)22.6%
Missing22246
Missing (%)24.9%
Memory size1.4 MiB
2023-12-09T14:43:37.200523image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length274
Median length220
Mean length26.92986047
Min length3

Characters and Unicode

Total characters1802631
Distinct characters47
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique11187 ?
Unique (%)16.7%

Sample

1st rowNext.js;React;Remix;Vue.js
2nd rowNode.js;React;Ruby on Rails;Vue.js;WordPress
3rd rowExpress;Gatsby;NestJS;Next.js;Node.js;React
4th rowAngular;Express;NestJS;Node.js
5th rowAngularJS;jQuery;Node.js;Phoenix;Ruby on Rails;Solid.js;Svelte;Vue.js
ValueCountFrequency (%)
boot 6221
 
6.5%
asp.net;asp.net 4084
 
4.3%
on 3940
 
4.1%
asp.net 3869
 
4.0%
rails 2343
 
2.4%
core 2243
 
2.3%
react 2017
 
2.1%
spring 1671
 
1.7%
angular;asp.net;asp.net 1529
 
1.6%
node.js 1452
 
1.5%
Other values (12923) 66473
69.4%
2023-12-09T14:43:37.439833image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
; 164518
 
9.1%
e 148449
 
8.2%
s 129710
 
7.2%
a 93050
 
5.2%
r 87743
 
4.9%
j 82009
 
4.5%
o 81303
 
4.5%
. 79068
 
4.4%
t 70980
 
3.9%
N 70012
 
3.9%
Other values (37) 795789
44.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1088097
60.4%
Uppercase Letter 442044
24.5%
Other Punctuation 243586
 
13.5%
Space Separator 28904
 
1.6%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 148449
13.6%
s 129710
11.9%
a 93050
 
8.6%
r 87743
 
8.1%
j 82009
 
7.5%
o 81303
 
7.5%
t 70980
 
6.5%
u 53190
 
4.9%
l 47528
 
4.4%
n 45178
 
4.2%
Other values (15) 248957
22.9%
Uppercase Letter
ValueCountFrequency (%)
N 70012
15.8%
R 49825
11.3%
E 47400
10.7%
S 46554
10.5%
A 44119
10.0%
P 38023
8.6%
T 21081
 
4.8%
Q 16173
 
3.7%
F 16078
 
3.6%
C 13134
 
3.0%
Other values (9) 79645
18.0%
Other Punctuation
ValueCountFrequency (%)
; 164518
67.5%
. 79068
32.5%
Space Separator
ValueCountFrequency (%)
28904
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1530141
84.9%
Common 272490
 
15.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 148449
 
9.7%
s 129710
 
8.5%
a 93050
 
6.1%
r 87743
 
5.7%
j 82009
 
5.4%
o 81303
 
5.3%
t 70980
 
4.6%
N 70012
 
4.6%
u 53190
 
3.5%
R 49825
 
3.3%
Other values (34) 663870
43.4%
Common
ValueCountFrequency (%)
; 164518
60.4%
. 79068
29.0%
28904
 
10.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1802631
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
; 164518
 
9.1%
e 148449
 
8.2%
s 129710
 
7.2%
a 93050
 
5.2%
r 87743
 
4.9%
j 82009
 
4.5%
o 81303
 
4.5%
. 79068
 
4.4%
t 70980
 
3.9%
N 70012
 
3.9%
Other values (37) 795789
44.1%
Distinct14620
Distinct (%)25.8%
Missing32443
Missing (%)36.4%
Memory size1.4 MiB
2023-12-09T14:43:37.593086image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length274
Median length233
Mean length26.21319681
Min length3

Characters and Unicode

Total characters1487363
Distinct characters47
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique11055 ?
Unique (%)19.5%

Sample

1st rowDeno;Elm;Nuxt.js;React;Svelte;Vue.js
2nd rowNode.js;Ruby on Rails;Vue.js
3rd rowExpress;NestJS;Next.js;Node.js;React;Remix;Vue.js
4th rowDeno;FastAPI;Fastify;Flask;NestJS;Next.js;Node.js;Phoenix;React;Remix
5th rowASP.NET CORE;Qwik;Ruby on Rails;Svelte
ValueCountFrequency (%)
asp.net 4984
 
6.2%
boot 4501
 
5.6%
on 3501
 
4.3%
asp.net;asp.net 2140
 
2.7%
rails 1850
 
2.3%
core 1666
 
2.1%
angular;asp.net 1655
 
2.1%
react 1561
 
1.9%
spring 1425
 
1.8%
core;blazor 1224
 
1.5%
Other values (12946) 56221
69.6%
2023-12-09T14:43:37.828521image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
; 138485
 
9.3%
e 130515
 
8.8%
s 103893
 
7.0%
a 74491
 
5.0%
t 71841
 
4.8%
. 70463
 
4.7%
j 68779
 
4.6%
o 68086
 
4.6%
N 59137
 
4.0%
r 51694
 
3.5%
Other values (37) 649979
43.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 900516
60.5%
Uppercase Letter 353912
 
23.8%
Other Punctuation 208948
 
14.0%
Space Separator 23987
 
1.6%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 130515
14.5%
s 103893
11.5%
a 74491
 
8.3%
t 71841
 
8.0%
j 68779
 
7.6%
o 68086
 
7.6%
r 51694
 
5.7%
l 44735
 
5.0%
n 38183
 
4.2%
u 37266
 
4.1%
Other values (15) 211033
23.4%
Uppercase Letter
ValueCountFrequency (%)
N 59137
16.7%
R 43411
12.3%
S 42665
12.1%
E 34662
9.8%
A 32384
9.2%
P 26934
7.6%
T 14467
 
4.1%
F 13407
 
3.8%
D 12311
 
3.5%
V 12001
 
3.4%
Other values (9) 62533
17.7%
Other Punctuation
ValueCountFrequency (%)
; 138485
66.3%
. 70463
33.7%
Space Separator
ValueCountFrequency (%)
23987
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1254428
84.3%
Common 232935
 
15.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 130515
 
10.4%
s 103893
 
8.3%
a 74491
 
5.9%
t 71841
 
5.7%
j 68779
 
5.5%
o 68086
 
5.4%
N 59137
 
4.7%
r 51694
 
4.1%
l 44735
 
3.6%
R 43411
 
3.5%
Other values (34) 537846
42.9%
Common
ValueCountFrequency (%)
; 138485
59.5%
. 70463
30.3%
23987
 
10.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1487363
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
; 138485
 
9.3%
e 130515
 
8.8%
s 103893
 
7.0%
a 74491
 
5.0%
t 71841
 
4.8%
. 70463
 
4.7%
j 68779
 
4.6%
o 68086
 
4.6%
N 59137
 
4.0%
r 51694
 
3.5%
Other values (37) 649979
43.7%
Distinct10322
Distinct (%)18.1%
Missing32165
Missing (%)36.1%
Memory size1.4 MiB
2023-12-09T14:43:37.952466image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length345
Median length245
Mean length28.57640436
Min length2

Characters and Unicode

Total characters1629398
Distinct characters56
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7686 ?
Unique (%)13.5%

Sample

1st rowElectron;React Native;Tauri
2nd rowRabbitMQ;Spring Framework
3rd rowNumPy;Pandas;Scikit-Learn;Tauri;TensorFlow
4th row.NET (5+)
5th row.NET (5+) ;.NET Framework (1.0 - 4.8)
ValueCountFrequency (%)
net 29026
18.1%
framework 17661
 
11.0%
5 17005
 
10.6%
1.0 11452
 
7.1%
11452
 
7.1%
apache 6717
 
4.2%
4.8 5336
 
3.3%
native 3497
 
2.2%
spring 2393
 
1.5%
react 2049
 
1.3%
Other values (7802) 54079
33.7%
2023-12-09T14:43:38.167769image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 111982
 
6.9%
107311
 
6.6%
r 101594
 
6.2%
; 98850
 
6.1%
e 81291
 
5.0%
o 61307
 
3.8%
n 56447
 
3.5%
T 54306
 
3.3%
. 52934
 
3.2%
t 51273
 
3.1%
Other values (46) 852103
52.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 868549
53.3%
Uppercase Letter 341347
 
20.9%
Other Punctuation 157668
 
9.7%
Space Separator 107311
 
6.6%
Decimal Number 62813
 
3.9%
Close Punctuation 28457
 
1.7%
Open Punctuation 28457
 
1.7%
Dash Punctuation 17791
 
1.1%
Math Symbol 17005
 
1.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 111982
12.9%
r 101594
11.7%
e 81291
 
9.4%
o 61307
 
7.1%
n 56447
 
6.5%
t 51273
 
5.9%
c 48014
 
5.5%
i 45552
 
5.2%
m 36968
 
4.3%
k 34746
 
4.0%
Other values (12) 239375
27.6%
Uppercase Letter
ValueCountFrequency (%)
T 54306
15.9%
N 49311
14.4%
E 34715
10.2%
F 33972
10.0%
P 32611
9.6%
S 19190
 
5.6%
A 13988
 
4.1%
R 12609
 
3.7%
Q 12106
 
3.5%
K 11253
 
3.3%
Other values (11) 67286
19.7%
Decimal Number
ValueCountFrequency (%)
5 17005
27.1%
1 11452
18.2%
4 11452
18.2%
8 11452
18.2%
0 11452
18.2%
Other Punctuation
ValueCountFrequency (%)
; 98850
62.7%
. 52934
33.6%
/ 5884
 
3.7%
Space Separator
ValueCountFrequency (%)
107311
100.0%
Close Punctuation
ValueCountFrequency (%)
) 28457
100.0%
Open Punctuation
ValueCountFrequency (%)
( 28457
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 17791
100.0%
Math Symbol
ValueCountFrequency (%)
+ 17005
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1209896
74.3%
Common 419502
 
25.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 111982
 
9.3%
r 101594
 
8.4%
e 81291
 
6.7%
o 61307
 
5.1%
n 56447
 
4.7%
T 54306
 
4.5%
t 51273
 
4.2%
N 49311
 
4.1%
c 48014
 
4.0%
i 45552
 
3.8%
Other values (33) 548819
45.4%
Common
ValueCountFrequency (%)
107311
25.6%
; 98850
23.6%
. 52934
12.6%
) 28457
 
6.8%
( 28457
 
6.8%
- 17791
 
4.2%
5 17005
 
4.1%
+ 17005
 
4.1%
1 11452
 
2.7%
4 11452
 
2.7%
Other values (3) 28788
 
6.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1629398
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 111982
 
6.9%
107311
 
6.6%
r 101594
 
6.2%
; 98850
 
6.1%
e 81291
 
5.0%
o 61307
 
3.8%
n 56447
 
3.5%
T 54306
 
3.3%
. 52934
 
3.2%
t 51273
 
3.1%
Other values (46) 852103
52.3%
Distinct11775
Distinct (%)25.1%
Missing42336
Missing (%)47.5%
Memory size1.4 MiB
2023-12-09T14:43:38.291324image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length345
Median length243
Mean length31.13881062
Min length2

Characters and Unicode

Total characters1458791
Distinct characters56
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9043 ?
Unique (%)19.3%

Sample

1st rowCapacitor;Electron;Tauri;Uno Platform;Xamarin
2nd row.NET MAUI;Apache Kafka;Apache Spark;Hugging Face Transformers;Ionic;JAX;NumPy;Pandas;Scikit-Learn;Tauri;TensorFlow;Tidyverse;Torch/PyTorch
3rd row.NET (5+)
4th row.NET (5+) ;Flutter
5th rowApache Kafka;Flutter;RabbitMQ;Spring Framework
ValueCountFrequency (%)
net 20624
 
17.3%
5 13889
 
11.7%
framework 7690
 
6.5%
apache 7569
 
6.4%
1.0 3854
 
3.2%
3854
 
3.2%
face 3186
 
2.7%
native 2704
 
2.3%
kafka;apache 2154
 
1.8%
react 1579
 
1.3%
Other values (8395) 51799
43.6%
2023-12-09T14:43:38.500437image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 101778
 
7.0%
; 99494
 
6.8%
r 92412
 
6.3%
e 78463
 
5.4%
75636
 
5.2%
o 60447
 
4.1%
T 56070
 
3.8%
c 53475
 
3.7%
n 53429
 
3.7%
t 52974
 
3.6%
Other values (46) 734613
50.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 832799
57.1%
Uppercase Letter 325278
 
22.3%
Other Punctuation 137053
 
9.4%
Space Separator 75636
 
5.2%
Decimal Number 29305
 
2.0%
Close Punctuation 17743
 
1.2%
Open Punctuation 17743
 
1.2%
Math Symbol 13889
 
1.0%
Dash Punctuation 9345
 
0.6%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 101778
12.2%
r 92412
 
11.1%
e 78463
 
9.4%
o 60447
 
7.3%
c 53475
 
6.4%
n 53429
 
6.4%
t 52974
 
6.4%
i 43585
 
5.2%
p 31154
 
3.7%
s 30062
 
3.6%
Other values (12) 235020
28.2%
Uppercase Letter
ValueCountFrequency (%)
T 56070
17.2%
N 38096
11.7%
F 29285
 
9.0%
P 28925
 
8.9%
E 25670
 
7.9%
A 18942
 
5.8%
S 17528
 
5.4%
K 12551
 
3.9%
R 12474
 
3.8%
U 11754
 
3.6%
Other values (11) 73983
22.7%
Decimal Number
ValueCountFrequency (%)
5 13889
47.4%
8 3854
 
13.2%
4 3854
 
13.2%
0 3854
 
13.2%
1 3854
 
13.2%
Other Punctuation
ValueCountFrequency (%)
; 99494
72.6%
. 29283
 
21.4%
/ 8276
 
6.0%
Space Separator
ValueCountFrequency (%)
75636
100.0%
Close Punctuation
ValueCountFrequency (%)
) 17743
100.0%
Open Punctuation
ValueCountFrequency (%)
( 17743
100.0%
Math Symbol
ValueCountFrequency (%)
+ 13889
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9345
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1158077
79.4%
Common 300714
 
20.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 101778
 
8.8%
r 92412
 
8.0%
e 78463
 
6.8%
o 60447
 
5.2%
T 56070
 
4.8%
c 53475
 
4.6%
n 53429
 
4.6%
t 52974
 
4.6%
i 43585
 
3.8%
N 38096
 
3.3%
Other values (33) 527348
45.5%
Common
ValueCountFrequency (%)
; 99494
33.1%
75636
25.2%
. 29283
 
9.7%
) 17743
 
5.9%
( 17743
 
5.9%
5 13889
 
4.6%
+ 13889
 
4.6%
- 9345
 
3.1%
/ 8276
 
2.8%
8 3854
 
1.3%
Other values (3) 11562
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1458791
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 101778
 
7.0%
; 99494
 
6.8%
r 92412
 
6.3%
e 78463
 
5.4%
75636
 
5.2%
o 60447
 
4.1%
T 56070
 
3.8%
c 53475
 
3.7%
n 53429
 
3.7%
t 52974
 
3.6%
Other values (46) 734613
50.4%
Distinct33133
Distinct (%)42.5%
Missing11300
Missing (%)12.7%
Memory size1.4 MiB
2023-12-09T14:43:38.640088image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length421
Median length209
Mean length35.76140157
Min length3

Characters and Unicode

Total characters2785241
Distinct characters52
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27546 ?
Unique (%)35.4%

Sample

1st rowDocker;Kubernetes;npm;Pip;Vite;Webpack;Yarn
2nd rowCargo;Docker;Kubernetes;Make;Nix
3rd rowHomebrew;npm;Vite;Webpack;Yarn
4th rowDocker;npm;Webpack;Yarn
5th rowDocker;Homebrew;Kubernetes;npm;pnpm;Terraform
ValueCountFrequency (%)
build 12109
 
8.0%
studio 11751
 
7.7%
solution 8616
 
5.7%
tool 2418
 
1.6%
3d 1945
 
1.3%
gnu 1677
 
1.1%
gcc;llvm's 1490
 
1.0%
3d;unreal 1416
 
0.9%
maven 1313
 
0.9%
visual 1301
 
0.9%
Other values (25046) 107754
71.0%
2023-12-09T14:43:38.865294image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
; 296449
 
10.6%
e 258761
 
9.3%
o 178398
 
6.4%
a 152047
 
5.5%
r 151440
 
5.4%
n 143787
 
5.2%
i 112233
 
4.0%
p 98436
 
3.5%
t 98115
 
3.5%
l 93202
 
3.3%
Other values (42) 1202373
43.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1876749
67.4%
Uppercase Letter 501235
 
18.0%
Other Punctuation 302750
 
10.9%
Space Separator 73906
 
2.7%
Open Punctuation 12109
 
0.4%
Close Punctuation 12109
 
0.4%
Decimal Number 6383
 
0.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 258761
13.8%
o 178398
 
9.5%
a 152047
 
8.1%
r 151440
 
8.1%
n 143787
 
7.7%
i 112233
 
6.0%
p 98436
 
5.2%
t 98115
 
5.2%
l 93202
 
5.0%
m 87125
 
4.6%
Other values (14) 503205
26.8%
Uppercase Letter
ValueCountFrequency (%)
C 66244
13.2%
M 59956
12.0%
G 48896
9.8%
D 47796
9.5%
P 39029
 
7.8%
S 36066
 
7.2%
V 33183
 
6.6%
N 26848
 
5.4%
U 18135
 
3.6%
H 17647
 
3.5%
Other values (10) 107435
21.4%
Other Punctuation
ValueCountFrequency (%)
; 296449
97.9%
' 5959
 
2.0%
. 342
 
0.1%
Decimal Number
ValueCountFrequency (%)
3 5561
87.1%
2 822
 
12.9%
Space Separator
ValueCountFrequency (%)
73906
100.0%
Open Punctuation
ValueCountFrequency (%)
( 12109
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12109
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 2377984
85.4%
Common 407257
 
14.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 258761
 
10.9%
o 178398
 
7.5%
a 152047
 
6.4%
r 151440
 
6.4%
n 143787
 
6.0%
i 112233
 
4.7%
p 98436
 
4.1%
t 98115
 
4.1%
l 93202
 
3.9%
m 87125
 
3.7%
Other values (34) 1004440
42.2%
Common
ValueCountFrequency (%)
; 296449
72.8%
73906
 
18.1%
( 12109
 
3.0%
) 12109
 
3.0%
' 5959
 
1.5%
3 5561
 
1.4%
2 822
 
0.2%
. 342
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2785241
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
; 296449
 
10.6%
e 258761
 
9.3%
o 178398
 
6.4%
a 152047
 
5.5%
r 151440
 
5.4%
n 143787
 
5.2%
i 112233
 
4.0%
p 98436
 
3.5%
t 98115
 
3.5%
l 93202
 
3.3%
Other values (42) 1202373
43.2%
Distinct27456
Distinct (%)40.2%
Missing20869
Missing (%)23.4%
Memory size1.4 MiB
2023-12-09T14:43:38.999799image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length421
Median length224
Mean length32.12057381
Min length3

Characters and Unicode

Total characters2194317
Distinct characters52
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique22439 ?
Unique (%)32.8%

Sample

1st rowGodot;npm;pnpm;Unity 3D;Unreal Engine;Vite;Webpack;Yarn
2nd rowCargo;Kubernetes;Nix
3rd rowHomebrew;npm;Vite
4th rowDocker;npm;Yarn
5th rowCargo
ValueCountFrequency (%)
studio 8360
 
6.8%
build 6721
 
5.4%
solution 6423
 
5.2%
3d;unreal 2804
 
2.3%
engine 2378
 
1.9%
tool 1794
 
1.5%
3d 1463
 
1.2%
gnu 1372
 
1.1%
docker 1331
 
1.1%
gcc;llvm's 1254
 
1.0%
Other values (21867) 89600
72.6%
2023-12-09T14:43:39.215435image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
; 223378
 
10.2%
e 222286
 
10.1%
o 141450
 
6.4%
r 140711
 
6.4%
n 123173
 
5.6%
a 111222
 
5.1%
i 88044
 
4.0%
t 82874
 
3.8%
u 70670
 
3.2%
m 67253
 
3.1%
Other values (42) 923256
42.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1503503
68.5%
Uppercase Letter 386971
 
17.6%
Other Punctuation 228868
 
10.4%
Space Separator 55185
 
2.5%
Close Punctuation 6721
 
0.3%
Open Punctuation 6721
 
0.3%
Decimal Number 6348
 
0.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 222286
14.8%
o 141450
 
9.4%
r 140711
 
9.4%
n 123173
 
8.2%
a 111222
 
7.4%
i 88044
 
5.9%
t 82874
 
5.5%
u 70670
 
4.7%
m 67253
 
4.5%
l 66162
 
4.4%
Other values (14) 389658
25.9%
Uppercase Letter
ValueCountFrequency (%)
C 48993
12.7%
D 43193
11.2%
M 37018
9.6%
G 35590
9.2%
P 28531
 
7.4%
V 28388
 
7.3%
S 24136
 
6.2%
N 20320
 
5.3%
K 19424
 
5.0%
U 17958
 
4.6%
Other values (10) 83420
21.6%
Other Punctuation
ValueCountFrequency (%)
; 223378
97.6%
' 5236
 
2.3%
. 254
 
0.1%
Decimal Number
ValueCountFrequency (%)
3 5653
89.1%
2 695
 
10.9%
Space Separator
ValueCountFrequency (%)
55185
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6721
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6721
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1890474
86.2%
Common 303843
 
13.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 222286
 
11.8%
o 141450
 
7.5%
r 140711
 
7.4%
n 123173
 
6.5%
a 111222
 
5.9%
i 88044
 
4.7%
t 82874
 
4.4%
u 70670
 
3.7%
m 67253
 
3.6%
l 66162
 
3.5%
Other values (34) 776629
41.1%
Common
ValueCountFrequency (%)
; 223378
73.5%
55185
 
18.2%
) 6721
 
2.2%
( 6721
 
2.2%
3 5653
 
1.9%
' 5236
 
1.7%
2 695
 
0.2%
. 254
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2194317
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
; 223378
 
10.2%
e 222286
 
10.1%
o 141450
 
6.4%
r 140711
 
6.4%
n 123173
 
5.6%
a 111222
 
5.1%
i 88044
 
4.0%
t 82874
 
3.8%
u 70670
 
3.2%
m 67253
 
3.1%
Other values (42) 923256
42.1%
Distinct21262
Distinct (%)24.8%
Missing3320
Missing (%)3.7%
Memory size1.4 MiB
2023-12-09T14:43:39.342953image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length370
Median length243
Mean length42.39065266
Min length3

Characters and Unicode

Total characters3639831
Distinct characters50
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique16694 ?
Unique (%)19.4%

Sample

1st rowVim;Visual Studio Code
2nd rowEmacs;Helix
3rd rowIntelliJ IDEA;Vim;Visual Studio Code;WebStorm
4th rowVim;Visual Studio Code
5th rowHelix;Neovim
ValueCountFrequency (%)
studio 69571
22.6%
code 51456
16.7%
studio;visual 22108
 
7.2%
android 14553
 
4.7%
visual 14287
 
4.6%
intellij 10038
 
3.3%
text;visual 5699
 
1.9%
code;xcode 5480
 
1.8%
idea;visual 3867
 
1.3%
notepad++;visual 3814
 
1.2%
Other values (8553) 106757
34.7%
2023-12-09T14:43:39.563956image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
i 307253
 
8.4%
o 304858
 
8.4%
d 248961
 
6.8%
u 235318
 
6.5%
t 232585
 
6.4%
e 225381
 
6.2%
221766
 
6.1%
; 215987
 
5.9%
a 166536
 
4.6%
l 166122
 
4.6%
Other values (40) 1315064
36.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 2444259
67.2%
Uppercase Letter 690356
 
19.0%
Other Punctuation 233126
 
6.4%
Space Separator 221766
 
6.1%
Math Symbol 46402
 
1.3%
Close Punctuation 1961
 
0.1%
Open Punctuation 1961
 
0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
S 134731
19.5%
V 111319
16.1%
C 90387
13.1%
N 53325
 
7.7%
I 50644
 
7.3%
J 45257
 
6.6%
A 42636
 
6.2%
E 36554
 
5.3%
D 29566
 
4.3%
P 22158
 
3.2%
Other values (12) 73779
10.7%
Lowercase Letter
ValueCountFrequency (%)
i 307253
12.6%
o 304858
12.5%
d 248961
10.2%
u 235318
9.6%
t 232585
9.5%
e 225381
9.2%
a 166536
6.8%
l 166122
6.8%
s 105858
 
4.3%
r 77983
 
3.2%
Other values (10) 373404
15.3%
Other Punctuation
ValueCountFrequency (%)
; 215987
92.6%
/ 11024
 
4.7%
: 4154
 
1.8%
, 1961
 
0.8%
Space Separator
ValueCountFrequency (%)
221766
100.0%
Math Symbol
ValueCountFrequency (%)
+ 46402
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1961
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1961
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 3134615
86.1%
Common 505216
 
13.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
i 307253
 
9.8%
o 304858
 
9.7%
d 248961
 
7.9%
u 235318
 
7.5%
t 232585
 
7.4%
e 225381
 
7.2%
a 166536
 
5.3%
l 166122
 
5.3%
S 134731
 
4.3%
V 111319
 
3.6%
Other values (32) 1001551
32.0%
Common
ValueCountFrequency (%)
221766
43.9%
; 215987
42.8%
+ 46402
 
9.2%
/ 11024
 
2.2%
: 4154
 
0.8%
) 1961
 
0.4%
, 1961
 
0.4%
( 1961
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3639831
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i 307253
 
8.4%
o 304858
 
8.4%
d 248961
 
6.8%
u 235318
 
6.5%
t 232585
 
6.4%
e 225381
 
6.2%
221766
 
6.1%
; 215987
 
5.9%
a 166536
 
4.6%
l 166122
 
4.6%
Other values (40) 1315064
36.1%
Distinct13659
Distinct (%)17.8%
Missing12535
Missing (%)14.1%
Memory size1.4 MiB
2023-12-09T14:43:39.689822image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length370
Median length281
Mean length33.87779358
Min length3

Characters and Unicode

Total characters2596699
Distinct characters50
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10071 ?
Unique (%)13.1%

Sample

1st rowVim;Visual Studio Code
2nd rowEmacs;Helix
3rd rowIntelliJ IDEA;Vim;WebStorm
4th rowVim;Visual Studio Code
5th rowIPython;Neovim;RStudio;Visual Studio Code
ValueCountFrequency (%)
studio 55441
23.0%
code 42590
17.7%
visual 15597
 
6.5%
studio;visual 13881
 
5.8%
intellij 9011
 
3.7%
android 9007
 
3.7%
code;xcode 3478
 
1.4%
notepad++;visual 3209
 
1.3%
jupyter 3150
 
1.3%
idea;visual 3066
 
1.3%
Other values (6135) 82845
34.3%
2023-12-09T14:43:39.908257image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
i 225866
 
8.7%
o 221836
 
8.5%
d 181163
 
7.0%
u 172598
 
6.6%
164626
 
6.3%
t 162432
 
6.3%
e 161703
 
6.2%
; 137311
 
5.3%
a 119706
 
4.6%
l 119122
 
4.6%
Other values (40) 930336
35.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1751238
67.4%
Uppercase Letter 500434
 
19.3%
Space Separator 164626
 
6.3%
Other Punctuation 147977
 
5.7%
Math Symbol 29220
 
1.1%
Close Punctuation 1602
 
0.1%
Open Punctuation 1602
 
0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
S 96967
19.4%
V 83785
16.7%
C 69541
13.9%
N 37501
 
7.5%
I 36906
 
7.4%
J 33047
 
6.6%
A 27109
 
5.4%
E 23409
 
4.7%
D 22060
 
4.4%
P 15677
 
3.1%
Other values (12) 54432
10.9%
Lowercase Letter
ValueCountFrequency (%)
i 225866
12.9%
o 221836
12.7%
d 181163
10.3%
u 172598
9.9%
t 162432
9.3%
e 161703
9.2%
a 119706
6.8%
l 119122
6.8%
s 73695
 
4.2%
m 56502
 
3.2%
Other values (10) 256615
14.7%
Other Punctuation
ValueCountFrequency (%)
; 137311
92.8%
/ 8080
 
5.5%
, 1602
 
1.1%
: 984
 
0.7%
Space Separator
ValueCountFrequency (%)
164626
100.0%
Math Symbol
ValueCountFrequency (%)
+ 29220
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1602
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1602
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 2251672
86.7%
Common 345027
 
13.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
i 225866
 
10.0%
o 221836
 
9.9%
d 181163
 
8.0%
u 172598
 
7.7%
t 162432
 
7.2%
e 161703
 
7.2%
a 119706
 
5.3%
l 119122
 
5.3%
S 96967
 
4.3%
V 83785
 
3.7%
Other values (32) 706494
31.4%
Common
ValueCountFrequency (%)
164626
47.7%
; 137311
39.8%
+ 29220
 
8.5%
/ 8080
 
2.3%
, 1602
 
0.5%
) 1602
 
0.5%
( 1602
 
0.5%
: 984
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2596699
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i 225866
 
8.7%
o 221836
 
8.5%
d 181163
 
7.0%
u 172598
 
6.6%
164626
 
6.3%
t 162432
 
6.3%
e 161703
 
6.2%
; 137311
 
5.3%
a 119706
 
4.6%
l 119122
 
4.6%
Other values (40) 930336
35.8%

OpSysPersonal use
Text

MISSING 

Distinct3050
Distinct (%)3.5%
Missing2627
Missing (%)2.9%
Memory size1.4 MiB
2023-12-09T14:43:40.029705image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length180
Median length147
Mean length19.63894312
Min length3

Characters and Unicode

Total characters1699888
Distinct characters44
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1666 ?
Unique (%)1.9%

Sample

1st rowiOS;iPadOS;MacOS;Windows;Windows Subsystem for Linux (WSL)
2nd rowMacOS;Other Linux-based
3rd rowiOS;iPadOS;MacOS
4th rowOther (Please Specify):
5th rowOther (Please Specify):
ValueCountFrequency (%)
windows 18014
 
11.4%
subsystem 14710
 
9.3%
for 14710
 
9.3%
linux 14710
 
9.3%
wsl 14409
 
9.1%
macos 10022
 
6.3%
ubuntu 4137
 
2.6%
windows;windows 3480
 
2.2%
ubuntu;windows 3427
 
2.2%
other 3199
 
2.0%
Other values (1889) 57682
36.4%
2023-12-09T14:43:40.240348image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
n 135980
 
8.0%
i 129900
 
7.6%
d 115229
 
6.8%
s 105673
 
6.2%
o 102577
 
6.0%
; 96262
 
5.7%
u 84293
 
5.0%
W 81506
 
4.8%
S 77837
 
4.6%
71943
 
4.2%
Other values (34) 698688
41.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1137075
66.9%
Uppercase Letter 351887
 
20.7%
Other Punctuation 98324
 
5.8%
Space Separator 71943
 
4.2%
Open Punctuation 16772
 
1.0%
Close Punctuation 16772
 
1.0%
Dash Punctuation 7115
 
0.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n 135980
12.0%
i 129900
11.4%
d 115229
10.1%
s 105673
9.3%
o 102577
9.0%
u 84293
 
7.4%
w 67704
 
6.0%
a 55989
 
4.9%
b 52933
 
4.7%
r 51987
 
4.6%
Other values (12) 234810
20.7%
Uppercase Letter
ValueCountFrequency (%)
W 81506
23.2%
S 77837
22.1%
O 54413
15.5%
L 36535
10.4%
M 28407
 
8.1%
U 23791
 
6.8%
A 22623
 
6.4%
D 8156
 
2.3%
P 7018
 
2.0%
F 3812
 
1.1%
Other values (6) 7789
 
2.2%
Other Punctuation
ValueCountFrequency (%)
; 96262
97.9%
: 2062
 
2.1%
Space Separator
ValueCountFrequency (%)
71943
100.0%
Open Punctuation
ValueCountFrequency (%)
( 16772
100.0%
Close Punctuation
ValueCountFrequency (%)
) 16772
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7115
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1488962
87.6%
Common 210926
 
12.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
n 135980
 
9.1%
i 129900
 
8.7%
d 115229
 
7.7%
s 105673
 
7.1%
o 102577
 
6.9%
u 84293
 
5.7%
W 81506
 
5.5%
S 77837
 
5.2%
w 67704
 
4.5%
a 55989
 
3.8%
Other values (28) 532274
35.7%
Common
ValueCountFrequency (%)
; 96262
45.6%
71943
34.1%
( 16772
 
8.0%
) 16772
 
8.0%
- 7115
 
3.4%
: 2062
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1699888
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
n 135980
 
8.0%
i 129900
 
7.6%
d 115229
 
6.8%
s 105673
 
6.2%
o 102577
 
6.0%
; 96262
 
5.7%
u 84293
 
5.0%
W 81506
 
4.8%
S 77837
 
4.6%
71943
 
4.2%
Other values (34) 698688
41.1%

OpSysProfessional use
Text

MISSING 

Distinct2470
Distinct (%)3.1%
Missing10597
Missing (%)11.9%
Memory size1.4 MiB
2023-12-09T14:43:40.362365image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length180
Median length156
Mean length18.4622902
Min length3

Characters and Unicode

Total characters1450896
Distinct characters44
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1367 ?
Unique (%)1.7%

Sample

1st rowMacOS;Windows;Windows Subsystem for Linux (WSL)
2nd rowMacOS;Other Linux-based
3rd rowiOS;iPadOS;MacOS
4th rowOther (Please Specify):
5th rowMacOS
ValueCountFrequency (%)
windows 16446
 
11.2%
for 13674
 
9.3%
linux 13674
 
9.3%
subsystem 13674
 
9.3%
wsl 13423
 
9.1%
macos 12928
 
8.8%
ubuntu 5201
 
3.5%
windows;windows 3968
 
2.7%
other 3338
 
2.3%
ubuntu;windows 3147
 
2.1%
Other values (1361) 47965
32.5%
2023-12-09T14:43:40.565365image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
n 113297
 
7.8%
i 100955
 
7.0%
s 90669
 
6.2%
d 84776
 
5.8%
u 80696
 
5.6%
o 79347
 
5.5%
; 73074
 
5.0%
68851
 
4.7%
S 68443
 
4.7%
W 68265
 
4.7%
Other values (34) 622523
42.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 963190
66.4%
Uppercase Letter 306621
 
21.1%
Other Punctuation 74774
 
5.2%
Space Separator 68851
 
4.7%
Open Punctuation 15374
 
1.1%
Close Punctuation 15374
 
1.1%
Dash Punctuation 6712
 
0.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n 113297
11.8%
i 100955
10.5%
s 90669
9.4%
d 84776
8.8%
u 80696
 
8.4%
o 79347
 
8.2%
w 55396
 
5.8%
a 53772
 
5.6%
b 50726
 
5.3%
t 49410
 
5.1%
Other values (12) 204146
21.2%
Uppercase Letter
ValueCountFrequency (%)
S 68443
22.3%
W 68265
22.3%
O 46972
15.3%
L 34060
11.1%
M 28786
9.4%
U 23281
 
7.6%
A 11339
 
3.7%
D 7576
 
2.5%
P 4118
 
1.3%
H 4117
 
1.3%
Other values (6) 9664
 
3.2%
Other Punctuation
ValueCountFrequency (%)
; 73074
97.7%
: 1700
 
2.3%
Space Separator
ValueCountFrequency (%)
68851
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15374
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15374
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6712
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1269811
87.5%
Common 181085
 
12.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
n 113297
 
8.9%
i 100955
 
8.0%
s 90669
 
7.1%
d 84776
 
6.7%
u 80696
 
6.4%
o 79347
 
6.2%
S 68443
 
5.4%
W 68265
 
5.4%
w 55396
 
4.4%
a 53772
 
4.2%
Other values (28) 474195
37.3%
Common
ValueCountFrequency (%)
; 73074
40.4%
68851
38.0%
( 15374
 
8.5%
) 15374
 
8.5%
- 6712
 
3.7%
: 1700
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1450896
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
n 113297
 
7.8%
i 100955
 
7.0%
s 90669
 
6.2%
d 84776
 
5.8%
u 80696
 
5.6%
o 79347
 
5.5%
; 73074
 
5.0%
68851
 
4.7%
S 68443
 
4.7%
W 68265
 
4.7%
Other values (34) 622523
42.9%
Distinct6258
Distinct (%)9.1%
Missing20094
Missing (%)22.5%
Memory size1.4 MiB
2023-12-09T14:43:40.702151image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length386
Median length298
Mean length22.81542915
Min length4

Characters and Unicode

Total characters1576318
Distinct characters51
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4000 ?
Unique (%)5.8%

Sample

1st rowAsana;Basecamp;GitHub Discussions;Jira;Linear;Notion;Trello
2nd rowMarkdown File;Stack Overflow for Teams
3rd rowJira
4th rowConfluence;Jira;Notion
5th rowJira;Markdown File;Notion;Stack Overflow for Teams
ValueCountFrequency (%)
azure 10495
 
8.8%
file 8152
 
6.8%
github 6474
 
5.4%
jira 5810
 
4.8%
confluence;jira 5779
 
4.8%
markdown 4559
 
3.8%
confluence;jira;markdown 3342
 
2.8%
discussions 2698
 
2.3%
devops 2502
 
2.1%
discussions;markdown 2453
 
2.0%
Other values (3006) 67565
56.4%
2023-12-09T14:43:40.932716image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
i 139332
 
8.8%
o 136046
 
8.6%
e 123495
 
7.8%
n 109032
 
6.9%
r 107502
 
6.8%
; 104312
 
6.6%
l 79814
 
5.1%
a 79236
 
5.0%
s 76574
 
4.9%
u 65793
 
4.2%
Other values (41) 555182
35.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1182384
75.0%
Uppercase Letter 236251
 
15.0%
Other Punctuation 106199
 
6.7%
Space Separator 50739
 
3.2%
Decimal Number 531
 
< 0.1%
Open Punctuation 107
 
< 0.1%
Close Punctuation 107
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i 139332
11.8%
o 136046
11.5%
e 123495
10.4%
n 109032
9.2%
r 107502
9.1%
l 79814
 
6.8%
a 79236
 
6.7%
s 76574
 
6.5%
u 65793
 
5.6%
c 50022
 
4.2%
Other values (15) 215538
18.2%
Uppercase Letter
ValueCountFrequency (%)
J 37642
15.9%
M 33544
14.2%
C 27572
11.7%
D 26358
11.2%
F 18813
8.0%
T 17654
7.5%
A 16700
7.1%
N 12970
 
5.5%
G 12205
 
5.2%
H 12205
 
5.2%
Other values (8) 20588
8.7%
Decimal Number
ValueCountFrequency (%)
3 177
33.3%
6 177
33.3%
0 177
33.3%
Other Punctuation
ValueCountFrequency (%)
; 104312
98.2%
. 1887
 
1.8%
Space Separator
ValueCountFrequency (%)
50739
100.0%
Open Punctuation
ValueCountFrequency (%)
( 107
100.0%
Close Punctuation
ValueCountFrequency (%)
) 107
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1418635
90.0%
Common 157683
 
10.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
i 139332
 
9.8%
o 136046
 
9.6%
e 123495
 
8.7%
n 109032
 
7.7%
r 107502
 
7.6%
l 79814
 
5.6%
a 79236
 
5.6%
s 76574
 
5.4%
u 65793
 
4.6%
c 50022
 
3.5%
Other values (33) 451789
31.8%
Common
ValueCountFrequency (%)
; 104312
66.2%
50739
32.2%
. 1887
 
1.2%
3 177
 
0.1%
6 177
 
0.1%
0 177
 
0.1%
( 107
 
0.1%
) 107
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1576318
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i 139332
 
8.8%
o 136046
 
8.6%
e 123495
 
7.8%
n 109032
 
6.9%
r 107502
 
6.8%
; 104312
 
6.6%
l 79814
 
5.1%
a 79236
 
5.0%
s 76574
 
4.9%
u 65793
 
4.2%
Other values (41) 555182
35.2%
Distinct3754
Distinct (%)7.0%
Missing35441
Missing (%)39.7%
Memory size1.4 MiB
2023-12-09T14:43:41.062981image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length386
Median length353
Mean length20.33963493
Min length4

Characters and Unicode

Total characters1093113
Distinct characters51
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2288 ?
Unique (%)4.3%

Sample

1st rowGitHub Discussions;Linear;Notion;Trello
2nd rowMarkdown File
3rd rowJira
4th rowConfluence;Jira;Notion
5th rowMarkdown File;Notion
ValueCountFrequency (%)
file 8473
 
8.8%
azure 7895
 
8.2%
github 6920
 
7.2%
markdown 6077
 
6.3%
jira 4467
 
4.6%
confluence;jira 3630
 
3.8%
discussions 2887
 
3.0%
discussions;markdown 2746
 
2.8%
devops 2597
 
2.7%
overflow 2085
 
2.2%
Other values (1889) 48799
50.5%
2023-12-09T14:43:41.583702image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
i 100448
 
9.2%
o 94971
 
8.7%
e 79329
 
7.3%
n 71994
 
6.6%
r 71247
 
6.5%
s 61922
 
5.7%
; 59665
 
5.5%
a 52931
 
4.8%
l 49486
 
4.5%
u 46097
 
4.2%
Other values (41) 405023
37.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 822934
75.3%
Uppercase Letter 166220
 
15.2%
Other Punctuation 60594
 
5.5%
Space Separator 42833
 
3.9%
Decimal Number 384
 
< 0.1%
Open Punctuation 74
 
< 0.1%
Close Punctuation 74
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i 100448
12.2%
o 94971
11.5%
e 79329
9.6%
n 71994
8.7%
r 71247
8.7%
s 61922
 
7.5%
a 52931
 
6.4%
l 49486
 
6.0%
u 46097
 
5.6%
c 32708
 
4.0%
Other values (15) 161801
19.7%
Uppercase Letter
ValueCountFrequency (%)
M 23936
14.4%
J 21503
12.9%
D 20876
12.6%
F 15714
9.5%
C 14592
8.8%
A 11153
6.7%
T 11082
6.7%
H 10625
6.4%
G 10625
6.4%
N 10110
6.1%
Other values (8) 16004
9.6%
Decimal Number
ValueCountFrequency (%)
3 128
33.3%
6 128
33.3%
0 128
33.3%
Other Punctuation
ValueCountFrequency (%)
; 59665
98.5%
. 929
 
1.5%
Space Separator
ValueCountFrequency (%)
42833
100.0%
Open Punctuation
ValueCountFrequency (%)
( 74
100.0%
Close Punctuation
ValueCountFrequency (%)
) 74
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 989154
90.5%
Common 103959
 
9.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
i 100448
 
10.2%
o 94971
 
9.6%
e 79329
 
8.0%
n 71994
 
7.3%
r 71247
 
7.2%
s 61922
 
6.3%
a 52931
 
5.4%
l 49486
 
5.0%
u 46097
 
4.7%
c 32708
 
3.3%
Other values (33) 328021
33.2%
Common
ValueCountFrequency (%)
; 59665
57.4%
42833
41.2%
. 929
 
0.9%
3 128
 
0.1%
6 128
 
0.1%
0 128
 
0.1%
( 74
 
0.1%
) 74
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1093113
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i 100448
 
9.2%
o 94971
 
8.7%
e 79329
 
7.3%
n 71994
 
6.6%
r 71247
 
6.5%
s 61922
 
5.7%
; 59665
 
5.5%
a 52931
 
4.8%
l 49486
 
4.5%
u 46097
 
4.2%
Other values (41) 405023
37.1%
Distinct6925
Distinct (%)8.3%
Missing5745
Missing (%)6.4%
Memory size1.4 MiB
2023-12-09T14:43:41.708623image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length212
Median length153
Mean length30.32767651
Min length3

Characters and Unicode

Total characters2530511
Distinct characters35
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4024 ?
Unique (%)4.8%

Sample

1st rowCisco Webex Teams;Discord;Google Chat;Google Meet;Signal;Skype;Slack;Telegram;Whatsapp;Zoom
2nd rowMicrosoft Teams;Slack;Zoom
3rd rowDiscord;Google Meet;Microsoft Teams;Slack;Zoom
4th rowDiscord;Google Meet;Slack;Zoom
5th rowGoogle Meet;Microsoft Teams;Slack;Zoom
ValueCountFrequency (%)
microsoft 17227
 
9.6%
google 16640
 
9.3%
discord;google 12834
 
7.2%
meet;microsoft 12038
 
6.7%
teams 10709
 
6.0%
chat;google 7460
 
4.2%
discord;microsoft 7307
 
4.1%
cisco 6031
 
3.4%
webex 6031
 
3.4%
teams;slack 4637
 
2.6%
Other values (2829) 77992
43.6%
2023-12-09T14:43:41.915305image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
o 287790
 
11.4%
e 211135
 
8.3%
; 196254
 
7.8%
a 183756
 
7.3%
s 164891
 
6.5%
t 127842
 
5.1%
c 127044
 
5.0%
l 107138
 
4.2%
m 107125
 
4.2%
i 104436
 
4.1%
Other values (25) 913100
36.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1858564
73.4%
Uppercase Letter 380226
 
15.0%
Other Punctuation 196254
 
7.8%
Space Separator 95467
 
3.8%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o 287790
15.5%
e 211135
11.4%
a 183756
9.9%
s 164891
8.9%
t 127842
 
6.9%
c 127044
 
6.8%
l 107138
 
5.8%
m 107125
 
5.8%
i 104436
 
5.6%
r 100487
 
5.4%
Other values (11) 336920
18.1%
Uppercase Letter
ValueCountFrequency (%)
M 80042
21.1%
T 65451
17.2%
S 61685
16.2%
G 39745
10.5%
Z 39194
10.3%
D 33882
8.9%
W 32149
8.5%
C 18040
 
4.7%
R 4393
 
1.2%
J 3029
 
0.8%
Other values (2) 2616
 
0.7%
Other Punctuation
ValueCountFrequency (%)
; 196254
100.0%
Space Separator
ValueCountFrequency (%)
95467
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 2238790
88.5%
Common 291721
 
11.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
o 287790
12.9%
e 211135
 
9.4%
a 183756
 
8.2%
s 164891
 
7.4%
t 127842
 
5.7%
c 127044
 
5.7%
l 107138
 
4.8%
m 107125
 
4.8%
i 104436
 
4.7%
r 100487
 
4.5%
Other values (23) 717146
32.0%
Common
ValueCountFrequency (%)
; 196254
67.3%
95467
32.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2530511
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o 287790
 
11.4%
e 211135
 
8.3%
; 196254
 
7.8%
a 183756
 
7.3%
s 164891
 
6.5%
t 127842
 
5.1%
c 127044
 
5.0%
l 107138
 
4.2%
m 107125
 
4.2%
i 104436
 
4.1%
Other values (25) 913100
36.1%
Distinct4078
Distinct (%)5.8%
Missing19408
Missing (%)21.8%
Memory size1.4 MiB
2023-12-09T14:43:42.047131image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length212
Median length177
Mean length21.68674043
Min length3

Characters and Unicode

Total characters1513214
Distinct characters35
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2225 ?
Unique (%)3.2%

Sample

1st rowDiscord;Signal;Slack;Zoom
2nd rowSlack;Zoom
3rd rowDiscord;Google Meet;Slack;Zoom
4th rowDiscord;Google Meet;Slack;Zoom
5th rowDiscord
ValueCountFrequency (%)
google 12400
 
10.4%
microsoft 11608
 
9.7%
teams 8425
 
7.0%
discord;google 7634
 
6.4%
slack 4903
 
4.1%
discord 4557
 
3.8%
meet;microsoft 4062
 
3.4%
discord;microsoft 4044
 
3.4%
meet;slack 4040
 
3.4%
chat;google 3890
 
3.2%
Other values (2326) 54139
45.2%
2023-12-09T14:43:42.259320image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
o 160946
 
10.6%
e 121198
 
8.0%
a 116278
 
7.7%
; 106385
 
7.0%
s 93576
 
6.2%
c 83384
 
5.5%
l 76334
 
5.0%
t 76308
 
5.0%
i 68589
 
4.5%
r 66526
 
4.4%
Other values (25) 543690
35.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1126044
74.4%
Uppercase Letter 230859
 
15.3%
Other Punctuation 106385
 
7.0%
Space Separator 49926
 
3.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o 160946
14.3%
e 121198
10.8%
a 116278
10.3%
s 93576
8.3%
c 83384
7.4%
l 76334
 
6.8%
t 76308
 
6.8%
i 68589
 
6.1%
r 66526
 
5.9%
m 55388
 
4.9%
Other values (11) 207517
18.4%
Uppercase Letter
ValueCountFrequency (%)
M 47401
20.5%
S 43424
18.8%
T 34725
15.0%
D 26837
11.6%
G 24530
10.6%
Z 19269
8.3%
W 17357
 
7.5%
C 9211
 
4.0%
R 3413
 
1.5%
I 2386
 
1.0%
Other values (2) 2306
 
1.0%
Other Punctuation
ValueCountFrequency (%)
; 106385
100.0%
Space Separator
ValueCountFrequency (%)
49926
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1356903
89.7%
Common 156311
 
10.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
o 160946
 
11.9%
e 121198
 
8.9%
a 116278
 
8.6%
s 93576
 
6.9%
c 83384
 
6.1%
l 76334
 
5.6%
t 76308
 
5.6%
i 68589
 
5.1%
r 66526
 
4.9%
m 55388
 
4.1%
Other values (23) 438376
32.3%
Common
ValueCountFrequency (%)
; 106385
68.1%
49926
31.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1513214
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o 160946
 
10.6%
e 121198
 
8.0%
a 116278
 
7.7%
; 106385
 
7.0%
s 93576
 
6.2%
c 83384
 
5.5%
l 76334
 
5.0%
t 76308
 
5.0%
i 68589
 
4.5%
r 66526
 
4.4%
Other values (25) 543690
35.9%
Distinct323
Distinct (%)0.6%
Missing32856
Missing (%)36.8%
Memory size1.4 MiB
2023-12-09T14:43:42.387974image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length104
Median length7
Mean length12.72198551
Min length4

Characters and Unicode

Total characters716604
Distinct characters34
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique132 ?
Unique (%)0.2%

Sample

1st rowChatGPT
2nd rowChatGPT
3rd rowChatGPT;Google Bard AI;Neeva AI
4th rowChatGPT
5th rowBing AI;ChatGPT;Google Bard AI
ValueCountFrequency (%)
chatgpt 32150
38.5%
bing 12884
15.4%
ai;chatgpt 6545
 
7.8%
bard 6217
 
7.5%
ai 5913
 
7.1%
chatgpt;wolframalpha 3855
 
4.6%
chatgpt;google 2893
 
3.5%
ai;chatgpt;google 2858
 
3.4%
wolframalpha 1669
 
2.0%
ai;chatgpt;wolframalpha 1172
 
1.4%
Other values (84) 7258
 
8.7%
2023-12-09T14:43:42.602749image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 76575
 
10.7%
h 63074
 
8.8%
G 58679
 
8.2%
P 55911
 
7.8%
t 53327
 
7.4%
C 52462
 
7.3%
T 52462
 
7.3%
; 29409
 
4.1%
A 28838
 
4.0%
27086
 
3.8%
Other values (24) 218781
30.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 359654
50.2%
Uppercase Letter 298854
41.7%
Other Punctuation 31010
 
4.3%
Space Separator 27086
 
3.8%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 76575
21.3%
h 63074
17.5%
t 53327
14.8%
o 25467
 
7.1%
l 23794
 
6.6%
g 19198
 
5.3%
r 16144
 
4.5%
i 15980
 
4.4%
n 15241
 
4.2%
m 10020
 
2.8%
Other values (9) 40834
11.4%
Uppercase Letter
ValueCountFrequency (%)
G 58679
19.6%
P 55911
18.7%
C 52462
17.6%
T 52462
17.6%
A 28838
9.6%
I 20226
 
6.8%
B 19198
 
6.4%
W 8419
 
2.8%
Y 1601
 
0.5%
Q 643
 
0.2%
Other values (2) 415
 
0.1%
Other Punctuation
ValueCountFrequency (%)
; 29409
94.8%
. 1601
 
5.2%
Space Separator
ValueCountFrequency (%)
27086
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 658508
91.9%
Common 58096
 
8.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 76575
11.6%
h 63074
 
9.6%
G 58679
 
8.9%
P 55911
 
8.5%
t 53327
 
8.1%
C 52462
 
8.0%
T 52462
 
8.0%
A 28838
 
4.4%
o 25467
 
3.9%
l 23794
 
3.6%
Other values (21) 167919
25.5%
Common
ValueCountFrequency (%)
; 29409
50.6%
27086
46.6%
. 1601
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 716604
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 76575
 
10.7%
h 63074
 
8.8%
G 58679
 
8.2%
P 55911
 
7.8%
t 53327
 
7.4%
C 52462
 
7.3%
T 52462
 
7.3%
; 29409
 
4.1%
A 28838
 
4.0%
27086
 
3.8%
Other values (24) 218781
30.5%
Distinct399
Distinct (%)0.9%
Missing43034
Missing (%)48.3%
Memory size1.4 MiB
2023-12-09T14:43:42.731709image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length104
Median length99
Mean length15.51014085
Min length4

Characters and Unicode

Total characters715793
Distinct characters34
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique183 ?
Unique (%)0.4%

Sample

1st rowChatGPT;Neeva AI
2nd rowChatGPT
3rd rowChatGPT;Neeva AI;Perplexity AI
4th rowBing AI;ChatGPT;Google Bard AI
5th rowChatGPT
ValueCountFrequency (%)
chatgpt 21609
25.5%
bard 12243
14.4%
bing 11660
13.8%
ai 10238
12.1%
chatgpt;google 5568
 
6.6%
ai;chatgpt;google 5074
 
6.0%
ai;chatgpt 4220
 
5.0%
chatgpt;wolframalpha 2573
 
3.0%
ai;wolframalpha 1852
 
2.2%
wolframalpha 1339
 
1.6%
Other values (93) 8406
 
9.9%
2023-12-09T14:43:42.942847image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 70179
 
9.8%
G 53948
 
7.5%
h 51597
 
7.2%
P 45611
 
6.4%
t 43098
 
6.0%
C 41705
 
5.8%
T 41705
 
5.8%
38632
 
5.4%
o 36221
 
5.1%
; 33490
 
4.7%
Other values (24) 259607
36.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 366096
51.1%
Uppercase Letter 276311
38.6%
Space Separator 38632
 
5.4%
Other Punctuation 34754
 
4.9%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 70179
19.2%
h 51597
14.1%
t 43098
11.8%
o 36221
9.9%
l 27784
 
7.6%
g 24179
 
6.6%
r 21671
 
5.9%
e 16335
 
4.5%
i 15482
 
4.2%
d 14818
 
4.0%
Other values (9) 44732
12.2%
Uppercase Letter
ValueCountFrequency (%)
G 53948
19.5%
P 45611
16.5%
C 41705
15.1%
T 41705
15.1%
A 33314
12.1%
I 25639
9.3%
B 24179
8.8%
W 7285
 
2.6%
Y 1264
 
0.5%
Q 750
 
0.3%
Other values (2) 911
 
0.3%
Other Punctuation
ValueCountFrequency (%)
; 33490
96.4%
. 1264
 
3.6%
Space Separator
ValueCountFrequency (%)
38632
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 642407
89.7%
Common 73386
 
10.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 70179
 
10.9%
G 53948
 
8.4%
h 51597
 
8.0%
P 45611
 
7.1%
t 43098
 
6.7%
C 41705
 
6.5%
T 41705
 
6.5%
o 36221
 
5.6%
A 33314
 
5.2%
l 27784
 
4.3%
Other values (21) 197245
30.7%
Common
ValueCountFrequency (%)
38632
52.6%
; 33490
45.6%
. 1264
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 715793
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 70179
 
9.8%
G 53948
 
7.5%
h 51597
 
7.2%
P 45611
 
6.4%
t 43098
 
6.0%
C 41705
 
5.8%
T 41705
 
5.8%
38632
 
5.4%
o 36221
 
5.1%
; 33490
 
4.7%
Other values (24) 259607
36.3%

AIDevHaveWorkedWith
Text

MISSING 

Distinct166
Distinct (%)0.6%
Missing63280
Missing (%)71.0%
Memory size1.4 MiB
2023-12-09T14:43:43.065477image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length122
Median length14
Mean length15.84670321
Min length7

Characters and Unicode

Total characters410493
Distinct characters34
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique85 ?
Unique (%)0.3%

Sample

1st rowGitHub Copilot
2nd rowGitHub Copilot;Tabnine
3rd rowGitHub Copilot
4th rowGitHub Copilot
5th rowAWS CodeWhisperer;GitHub Copilot;Tabnine
ValueCountFrequency (%)
github 20611
40.0%
copilot 18733
36.4%
copilot;tabnine 2539
 
4.9%
tabnine 2160
 
4.2%
aws 2010
 
3.9%
codewhisperer;github 1158
 
2.2%
codewhisperer 678
 
1.3%
ai 455
 
0.9%
code 384
 
0.7%
copilot;synk 236
 
0.5%
Other values (74) 2568
 
5.0%
2023-12-09T14:43:43.273735image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
i 53645
13.1%
o 47604
11.6%
t 45372
11.1%
b 27573
 
6.7%
25628
 
6.2%
C 25191
 
6.1%
p 24939
 
6.1%
u 22884
 
5.6%
l 22798
 
5.6%
G 22413
 
5.5%
Other values (24) 92446
22.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 292522
71.3%
Uppercase Letter 86386
 
21.0%
Space Separator 25628
 
6.2%
Other Punctuation 5957
 
1.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i 53645
18.3%
o 47604
16.3%
t 45372
15.5%
b 27573
9.4%
p 24939
8.5%
u 22884
7.8%
l 22798
7.8%
e 13617
 
4.7%
n 11483
 
3.9%
r 5592
 
1.9%
Other values (10) 17015
 
5.8%
Uppercase Letter
ValueCountFrequency (%)
C 25191
29.2%
G 22413
25.9%
H 22078
25.6%
T 5193
 
6.0%
W 4597
 
5.3%
A 2851
 
3.3%
S 2609
 
3.0%
I 606
 
0.7%
R 486
 
0.6%
M 211
 
0.2%
Other Punctuation
ValueCountFrequency (%)
; 5806
97.5%
. 151
 
2.5%
Space Separator
ValueCountFrequency (%)
25628
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 378908
92.3%
Common 31585
 
7.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
i 53645
14.2%
o 47604
12.6%
t 45372
12.0%
b 27573
7.3%
C 25191
 
6.6%
p 24939
 
6.6%
u 22884
 
6.0%
l 22798
 
6.0%
G 22413
 
5.9%
H 22078
 
5.8%
Other values (21) 64411
17.0%
Common
ValueCountFrequency (%)
25628
81.1%
; 5806
 
18.4%
. 151
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 410493
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i 53645
13.1%
o 47604
11.6%
t 45372
11.1%
b 27573
 
6.7%
25628
 
6.2%
C 25191
 
6.1%
p 24939
 
6.1%
u 22884
 
5.6%
l 22798
 
5.6%
G 22413
 
5.5%
Other values (24) 92446
22.5%

AIDevWantToWorkWith
Text

MISSING 

Distinct233
Distinct (%)1.2%
Missing69597
Missing (%)78.0%
Memory size1.4 MiB
2023-12-09T14:43:43.404164image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length122
Median length14
Mean length17.94598458
Min length7

Characters and Unicode

Total characters351508
Distinct characters34
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)0.5%

Sample

1st rowGitHub Copilot
2nd rowGitHub Copilot
3rd rowAWS CodeWhisperer;GitHub Copilot
4th rowGitHub Copilot
5th rowGitHub Copilot
ValueCountFrequency (%)
copilot 15065
35.7%
github 14902
35.3%
aws 2890
 
6.8%
codewhisperer;github 2072
 
4.9%
copilot;tabnine 966
 
2.3%
ai 821
 
1.9%
tabnine 804
 
1.9%
codewhisperer 660
 
1.6%
copilot;whispr 347
 
0.8%
copilot;replit 338
 
0.8%
Other values (76) 3337
 
7.9%
2023-12-09T14:43:43.623450image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
i 43111
12.3%
o 39272
11.2%
t 36523
 
10.4%
22615
 
6.4%
p 21725
 
6.2%
C 21414
 
6.1%
b 20242
 
5.8%
u 18561
 
5.3%
l 18382
 
5.2%
G 17858
 
5.1%
Other values (24) 91805
26.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 245739
69.9%
Uppercase Letter 76490
 
21.8%
Space Separator 22615
 
6.4%
Other Punctuation 6664
 
1.9%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i 43111
17.5%
o 39272
16.0%
t 36523
14.9%
p 21725
8.8%
b 20242
8.2%
u 18561
7.6%
l 18382
7.5%
e 14429
 
5.9%
r 8653
 
3.5%
n 5756
 
2.3%
Other values (10) 19085
7.8%
Uppercase Letter
ValueCountFrequency (%)
C 21414
28.0%
G 17858
23.3%
H 17304
22.6%
W 6913
 
9.0%
A 4499
 
5.9%
S 3575
 
4.7%
T 2216
 
2.9%
I 1182
 
1.5%
R 915
 
1.2%
D 361
 
0.5%
Other Punctuation
ValueCountFrequency (%)
; 6303
94.6%
. 361
 
5.4%
Space Separator
ValueCountFrequency (%)
22615
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 322229
91.7%
Common 29279
 
8.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
i 43111
13.4%
o 39272
12.2%
t 36523
11.3%
p 21725
 
6.7%
C 21414
 
6.6%
b 20242
 
6.3%
u 18561
 
5.8%
l 18382
 
5.7%
G 17858
 
5.5%
H 17304
 
5.4%
Other values (21) 67837
21.1%
Common
ValueCountFrequency (%)
22615
77.2%
; 6303
 
21.5%
. 361
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 351508
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i 43111
12.3%
o 39272
11.2%
t 36523
 
10.4%
22615
 
6.4%
p 21725
 
6.2%
C 21414
 
6.1%
b 20242
 
5.8%
u 18561
 
5.3%
l 18382
 
5.2%
G 17858
 
5.1%
Other values (24) 91805
26.1%

NEWSOSites
Text

MISSING 

Distinct16
Distinct (%)< 0.1%
Missing1211
Missing (%)1.4%
Memory size1.4 MiB
2023-12-09T14:43:43.734439image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length151
Median length29
Mean length31.24398395
Min length14

Characters and Unicode

Total characters2748627
Distinct characters32
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowStack Overflow;Stack Exchange
2nd rowStack Overflow;Stack Exchange;Stack Overflow for Teams (private knowledge sharing & collaboration platform for companies)
3rd rowStack Overflow;Stack Exchange
4th rowStack Overflow;Stack Exchange
5th rowStack Overflow
ValueCountFrequency (%)
stack 95090
30.8%
overflow;stack 59588
19.3%
exchange 52140
16.9%
overflow 37705
 
12.2%
for 8794
 
2.8%
on 7091
 
2.3%
knowledge 4397
 
1.4%
platform 4397
 
1.4%
4397
 
1.4%
sharing 4397
 
1.4%
Other values (16) 30995
 
10.0%
2023-12-09T14:43:43.908898image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 249008
 
9.1%
c 233503
 
8.5%
221018
 
8.0%
e 197811
 
7.2%
t 180093
 
6.6%
k 162986
 
5.9%
S 157978
 
5.7%
o 148918
 
5.4%
l 130108
 
4.7%
r 126553
 
4.6%
Other values (22) 940651
34.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 2116969
77.0%
Uppercase Letter 328055
 
11.9%
Space Separator 221018
 
8.0%
Other Punctuation 73791
 
2.7%
Open Punctuation 4397
 
0.2%
Close Punctuation 4397
 
0.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 249008
11.8%
c 233503
11.0%
e 197811
9.3%
t 180093
 
8.5%
k 162986
 
7.7%
o 148918
 
7.0%
l 130108
 
6.1%
r 126553
 
6.0%
v 111659
 
5.3%
f 111529
 
5.3%
Other values (11) 464801
22.0%
Uppercase Letter
ValueCountFrequency (%)
S 157978
48.2%
O 98338
30.0%
E 59640
 
18.2%
C 7091
 
2.2%
T 4397
 
1.3%
I 611
 
0.2%
Other Punctuation
ValueCountFrequency (%)
; 69394
94.0%
& 4397
 
6.0%
Space Separator
ValueCountFrequency (%)
221018
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4397
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4397
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 2445024
89.0%
Common 303603
 
11.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 249008
 
10.2%
c 233503
 
9.6%
e 197811
 
8.1%
t 180093
 
7.4%
k 162986
 
6.7%
S 157978
 
6.5%
o 148918
 
6.1%
l 130108
 
5.3%
r 126553
 
5.2%
v 111659
 
4.6%
Other values (17) 746407
30.5%
Common
ValueCountFrequency (%)
221018
72.8%
; 69394
 
22.9%
( 4397
 
1.4%
& 4397
 
1.4%
) 4397
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2748627
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 249008
 
9.1%
c 233503
 
8.5%
221018
 
8.0%
e 197811
 
7.2%
t 180093
 
6.6%
k 162986
 
5.9%
S 157978
 
5.7%
o 148918
 
5.4%
l 130108
 
4.7%
r 126553
 
4.6%
Other values (22) 940651
34.2%

SOVisitFreq
Text

MISSING 

Distinct5
Distinct (%)< 0.1%
Missing2044
Missing (%)2.3%
Memory size1.4 MiB
2023-12-09T14:43:44.011561image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length35
Median length31
Mean length23.89562773
Min length20

Characters and Unicode

Total characters2082265
Distinct characters24
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowDaily or almost daily
2nd rowA few times per month or weekly
3rd rowA few times per week
4th rowA few times per week
5th rowDaily or almost daily
ValueCountFrequency (%)
per 65016
14.4%
times 60349
13.4%
a 48397
10.7%
few 48397
10.7%
or 47103
10.4%
daily 44248
9.8%
week 28085
6.2%
month 24979
 
5.5%
almost 22124
 
4.9%
weekly 20312
 
4.5%
Other values (6) 42572
9.4%
2023-12-09T14:43:44.179084image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
364442
17.5%
e 291842
14.0%
t 128738
 
6.2%
i 116549
 
5.6%
l 115255
 
5.5%
m 112119
 
5.4%
r 112119
 
5.4%
o 103540
 
5.0%
w 96794
 
4.6%
s 91807
 
4.4%
Other values (14) 549060
26.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1630683
78.3%
Space Separator 364442
 
17.5%
Uppercase Letter 87140
 
4.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 291842
17.9%
t 128738
 
7.9%
i 116549
 
7.1%
l 115255
 
7.1%
m 112119
 
6.9%
r 112119
 
6.9%
o 103540
 
6.3%
w 96794
 
5.9%
s 91807
 
5.6%
a 82991
 
5.1%
Other values (9) 378929
23.2%
Uppercase Letter
ValueCountFrequency (%)
A 48397
55.5%
D 22124
25.4%
M 11952
 
13.7%
L 4667
 
5.4%
Space Separator
ValueCountFrequency (%)
364442
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1717823
82.5%
Common 364442
 
17.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 291842
17.0%
t 128738
 
7.5%
i 116549
 
6.8%
l 115255
 
6.7%
m 112119
 
6.5%
r 112119
 
6.5%
o 103540
 
6.0%
w 96794
 
5.6%
s 91807
 
5.3%
a 82991
 
4.8%
Other values (13) 466069
27.1%
Common
ValueCountFrequency (%)
364442
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2082265
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
364442
17.5%
e 291842
14.0%
t 128738
 
6.2%
i 116549
 
5.6%
l 115255
 
5.5%
m 112119
 
5.4%
r 112119
 
5.4%
o 103540
 
5.0%
w 96794
 
4.6%
s 91807
 
4.4%
Other values (14) 549060
26.4%

SOAccount
Text

MISSING 

Distinct3
Distinct (%)< 0.1%
Missing1332
Missing (%)1.5%
Memory size1.4 MiB
2023-12-09T14:43:44.251675image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length23
Median length3
Mean length4.416268269
Min length2

Characters and Unicode

Total characters387978
Distinct characters16
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowYes
2nd rowYes
3rd rowYes
4th rowNo
5th rowYes
ValueCountFrequency (%)
yes 66282
65.1%
no 14618
 
14.4%
not 6952
 
6.8%
sure/can't 6952
 
6.8%
remember 6952
 
6.8%
2023-12-09T14:43:44.375657image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 94090
24.3%
s 73234
18.9%
Y 66282
17.1%
N 21570
 
5.6%
o 21570
 
5.6%
r 20856
 
5.4%
t 13904
 
3.6%
13904
 
3.6%
m 13904
 
3.6%
u 6952
 
1.8%
Other values (6) 41712
10.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 272318
70.2%
Uppercase Letter 87852
 
22.6%
Space Separator 13904
 
3.6%
Other Punctuation 13904
 
3.6%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 94090
34.6%
s 73234
26.9%
o 21570
 
7.9%
r 20856
 
7.7%
t 13904
 
5.1%
m 13904
 
5.1%
u 6952
 
2.6%
c 6952
 
2.6%
a 6952
 
2.6%
n 6952
 
2.6%
Uppercase Letter
ValueCountFrequency (%)
Y 66282
75.4%
N 21570
 
24.6%
Other Punctuation
ValueCountFrequency (%)
/ 6952
50.0%
' 6952
50.0%
Space Separator
ValueCountFrequency (%)
13904
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 360170
92.8%
Common 27808
 
7.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 94090
26.1%
s 73234
20.3%
Y 66282
18.4%
N 21570
 
6.0%
o 21570
 
6.0%
r 20856
 
5.8%
t 13904
 
3.9%
m 13904
 
3.9%
u 6952
 
1.9%
c 6952
 
1.9%
Other values (3) 20856
 
5.8%
Common
ValueCountFrequency (%)
13904
50.0%
/ 6952
25.0%
' 6952
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 387978
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 94090
24.3%
s 73234
18.9%
Y 66282
17.1%
N 21570
 
5.6%
o 21570
 
5.6%
r 20856
 
5.4%
t 13904
 
3.6%
13904
 
3.6%
m 13904
 
3.6%
u 6952
 
1.8%
Other values (6) 41712
10.8%

SOPartFreq
Text

MISSING 

Distinct6
Distinct (%)< 0.1%
Missing23123
Missing (%)25.9%
Memory size1.4 MiB
2023-12-09T14:43:44.475662image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length50
Median length35
Mean length37.13846294
Min length20

Characters and Unicode

Total characters2453404
Distinct characters30
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowA few times per month or weekly
2nd rowLess than once per month or monthly
3rd rowLess than once per month or monthly
4th rowI have never participated in Q&A on Stack Overflow
5th rowLess than once per month or monthly
ValueCountFrequency (%)
per 47791
 
9.9%
or 45130
 
9.3%
month 43821
 
9.1%
less 34661
 
7.2%
once 34661
 
7.2%
monthly 34661
 
7.2%
than 34661
 
7.2%
in 16961
 
3.5%
overflow 16961
 
3.5%
stack 16961
 
3.5%
Other values (15) 157528
32.6%
2023-12-09T14:43:44.647029image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
417736
17.0%
e 253068
 
10.3%
n 198687
 
8.1%
o 193504
 
7.9%
t 179150
 
7.3%
r 143804
 
5.9%
h 130104
 
5.3%
a 107117
 
4.4%
m 92921
 
3.8%
s 83761
 
3.4%
Other values (20) 653552
26.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1884802
76.8%
Space Separator 417736
 
17.0%
Uppercase Letter 133905
 
5.5%
Other Punctuation 16961
 
0.7%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 253068
13.4%
n 198687
10.5%
o 193504
10.3%
t 179150
9.5%
r 143804
 
7.6%
h 130104
 
6.9%
a 107117
 
5.7%
m 92921
 
4.9%
s 83761
 
4.4%
p 82398
 
4.4%
Other values (10) 420288
22.3%
Uppercase Letter
ValueCountFrequency (%)
L 34661
25.9%
A 29406
22.0%
O 16961
12.7%
I 16961
12.7%
S 16961
12.7%
Q 16961
12.7%
D 1309
 
1.0%
M 685
 
0.5%
Space Separator
ValueCountFrequency (%)
417736
100.0%
Other Punctuation
ValueCountFrequency (%)
& 16961
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 2018707
82.3%
Common 434697
 
17.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 253068
12.5%
n 198687
 
9.8%
o 193504
 
9.6%
t 179150
 
8.9%
r 143804
 
7.1%
h 130104
 
6.4%
a 107117
 
5.3%
m 92921
 
4.6%
s 83761
 
4.1%
p 82398
 
4.1%
Other values (18) 554193
27.5%
Common
ValueCountFrequency (%)
417736
96.1%
& 16961
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2453404
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
417736
17.0%
e 253068
 
10.3%
n 198687
 
8.1%
o 193504
 
7.9%
t 179150
 
7.3%
r 143804
 
5.9%
h 130104
 
5.3%
a 107117
 
4.4%
m 92921
 
3.8%
s 83761
 
3.4%
Other values (20) 653552
26.6%

SOComm
Text

MISSING 

Distinct6
Distinct (%)< 0.1%
Missing1492
Missing (%)1.7%
Memory size1.4 MiB
2023-12-09T14:43:44.739709image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length15
Median length14
Mean length12.29066505
Min length7

Characters and Unicode

Total characters1077793
Distinct characters20
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowYes, definitely
2nd rowNeutral
3rd rowNo, not really
4th rowNeutral
5th rowNo, not at all
ValueCountFrequency (%)
not 41637
20.0%
no 40698
19.5%
really 29100
13.9%
yes 27022
13.0%
neutral 19033
9.1%
somewhat 19026
9.1%
at 11598
 
5.6%
all 11598
 
5.6%
definitely 7996
 
3.8%
sure 939
 
0.5%
2023-12-09T14:43:44.901635image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
120955
11.2%
e 111112
10.3%
l 108425
10.1%
o 101361
9.4%
t 99290
9.2%
a 90355
8.4%
, 67720
 
6.3%
N 60670
 
5.6%
r 49072
 
4.6%
n 48694
 
4.5%
Other values (10) 220139
20.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 801426
74.4%
Space Separator 120955
 
11.2%
Uppercase Letter 87692
 
8.1%
Other Punctuation 67720
 
6.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 111112
13.9%
l 108425
13.5%
o 101361
12.6%
t 99290
12.4%
a 90355
11.3%
r 49072
6.1%
n 48694
6.1%
s 46987
5.9%
y 37096
 
4.6%
u 19972
 
2.5%
Other values (6) 89062
11.1%
Uppercase Letter
ValueCountFrequency (%)
N 60670
69.2%
Y 27022
30.8%
Space Separator
ValueCountFrequency (%)
120955
100.0%
Other Punctuation
ValueCountFrequency (%)
, 67720
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 889118
82.5%
Common 188675
 
17.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 111112
12.5%
l 108425
12.2%
o 101361
11.4%
t 99290
11.2%
a 90355
10.2%
N 60670
6.8%
r 49072
 
5.5%
n 48694
 
5.5%
s 46987
 
5.3%
y 37096
 
4.2%
Other values (8) 136056
15.3%
Common
ValueCountFrequency (%)
120955
64.1%
, 67720
35.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1077793
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
120955
11.2%
e 111112
10.3%
l 108425
10.1%
o 101361
9.4%
t 99290
9.2%
a 90355
8.4%
, 67720
 
6.3%
N 60670
 
5.6%
r 49072
 
4.6%
n 48694
 
4.5%
Other values (10) 220139
20.4%

SOAI
Text

MISSING 

Distinct43056
Distinct (%)90.0%
Missing41326
Missing (%)46.3%
Memory size1.4 MiB
2023-12-09T14:43:45.092539image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length3049
Median length1155
Mean length102.5597183
Min length1

Characters and Unicode

Total characters4908303
Distinct characters510
Distinct categories20 ?
Distinct scripts13 ?
Distinct blocks19 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique42238 ?
Unique (%)88.3%

Sample

1st rowI don't think it's super necessary, but I think improving search and clarifying poorly worded questions would be useful (especially for moderators/editors).
2nd rowI'm wearing of Stack Overflow using AI.
3rd rowUsing AI to suggest better answer to my questions.
4th rowNeutral
5th rowTo suggest solutions in alternative language for example
ValueCountFrequency (%)
to 31428
 
3.7%
the 27845
 
3.3%
ai 24613
 
2.9%
and 18756
 
2.2%
it 18347
 
2.2%
i 16670
 
2.0%
a 16149
 
1.9%
be 14890
 
1.8%
questions 13475
 
1.6%
of 12096
 
1.4%
Other values (19163) 650498
77.0%
2023-12-09T14:43:45.379879image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
798791
16.3%
e 480081
 
9.8%
t 370357
 
7.5%
o 322144
 
6.6%
s 290297
 
5.9%
n 279124
 
5.7%
i 275200
 
5.6%
a 268785
 
5.5%
r 221407
 
4.5%
l 168687
 
3.4%
Other values (500) 1433430
29.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 3857190
78.6%
Space Separator 798798
 
16.3%
Uppercase Letter 141250
 
2.9%
Other Punctuation 96263
 
2.0%
Dash Punctuation 4333
 
0.1%
Close Punctuation 2946
 
0.1%
Open Punctuation 2674
 
0.1%
Decimal Number 1955
 
< 0.1%
Other Letter 1123
 
< 0.1%
Final Punctuation 1080
 
< 0.1%
Other values (10) 691
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
ا 98
 
8.7%
ل 69
 
6.1%
م 61
 
5.4%
ت 49
 
4.4%
ي 48
 
4.3%
و 39
 
3.5%
ه 26
 
2.3%
ن 25
 
2.2%
ع 24
 
2.1%
ف 22
 
2.0%
Other values (283) 662
58.9%
Lowercase Letter
ValueCountFrequency (%)
e 480081
12.4%
t 370357
 
9.6%
o 322144
 
8.4%
s 290297
 
7.5%
n 279124
 
7.2%
i 275200
 
7.1%
a 268785
 
7.0%
r 221407
 
5.7%
l 168687
 
4.4%
u 143745
 
3.7%
Other values (79) 1037363
26.9%
Uppercase Letter
ValueCountFrequency (%)
I 50866
36.0%
A 29590
20.9%
S 11706
 
8.3%
O 8665
 
6.1%
T 5581
 
4.0%
P 3750
 
2.7%
G 3534
 
2.5%
C 3145
 
2.2%
M 2911
 
2.1%
N 2698
 
1.9%
Other values (32) 18804
 
13.3%
Other Punctuation
ValueCountFrequency (%)
. 43248
44.9%
, 28386
29.5%
' 12905
 
13.4%
/ 3917
 
4.1%
" 2973
 
3.1%
? 1037
 
1.1%
! 1033
 
1.1%
: 974
 
1.0%
; 666
 
0.7%
& 441
 
0.5%
Other values (13) 683
 
0.7%
Decimal Number
ValueCountFrequency (%)
0 476
24.3%
1 453
23.2%
2 315
16.1%
3 182
 
9.3%
4 171
 
8.7%
5 132
 
6.8%
9 90
 
4.6%
8 51
 
2.6%
6 48
 
2.5%
7 37
 
1.9%
Nonspacing Mark
ValueCountFrequency (%)
13
39.4%
9
27.3%
2
 
6.1%
2
 
6.1%
2
 
6.1%
2
 
6.1%
1
 
3.0%
1
 
3.0%
1
 
3.0%
Other Symbol
ValueCountFrequency (%)
82
85.4%
7
 
7.3%
2
 
2.1%
2
 
2.1%
1
 
1.0%
1
 
1.0%
1
 
1.0%
Math Symbol
ValueCountFrequency (%)
+ 118
48.6%
= 53
21.8%
> 41
 
16.9%
| 14
 
5.8%
< 11
 
4.5%
~ 6
 
2.5%
Spacing Mark
ValueCountFrequency (%)
ি 10
40.0%
9
36.0%
3
 
12.0%
2
 
8.0%
1
 
4.0%
Modifier Symbol
ValueCountFrequency (%)
´ 16
33.3%
` 15
31.2%
¯ 12
25.0%
^ 5
 
10.4%
Dash Punctuation
ValueCountFrequency (%)
- 4294
99.1%
28
 
0.6%
11
 
0.3%
Open Punctuation
ValueCountFrequency (%)
( 2642
98.8%
[ 26
 
1.0%
6
 
0.2%
Final Punctuation
ValueCountFrequency (%)
1004
93.0%
74
 
6.9%
» 2
 
0.2%
Initial Punctuation
ValueCountFrequency (%)
78
75.7%
23
 
22.3%
« 2
 
1.9%
Format
ValueCountFrequency (%)
14
56.0%
9
36.0%
2
 
8.0%
Space Separator
ValueCountFrequency (%)
798791
> 99.9%
  7
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 2919
99.1%
] 27
 
0.9%
Connector Punctuation
ValueCountFrequency (%)
_ 113
100.0%
Currency Symbol
ValueCountFrequency (%)
$ 4
100.0%
Control
ValueCountFrequency (%)
 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 3997454
81.4%
Common 908673
 
18.5%
Cyrillic 985
 
< 0.1%
Arabic 654
 
< 0.1%
Han 355
 
< 0.1%
Bengali 109
 
< 0.1%
Thai 27
 
< 0.1%
Inherited 22
 
< 0.1%
Hangul 13
 
< 0.1%
Katakana 6
 
< 0.1%
Other values (3) 5
 
< 0.1%

Most frequent character per script

Han
ValueCountFrequency (%)
19
 
5.4%
13
 
3.7%
8
 
2.3%
7
 
2.0%
6
 
1.7%
5
 
1.4%
5
 
1.4%
5
 
1.4%
5
 
1.4%
4
 
1.1%
Other values (190) 278
78.3%
Latin
ValueCountFrequency (%)
e 480081
12.0%
t 370357
 
9.3%
o 322144
 
8.1%
s 290297
 
7.3%
n 279124
 
7.0%
i 275200
 
6.9%
a 268785
 
6.7%
r 221407
 
5.5%
l 168687
 
4.2%
u 143745
 
3.6%
Other values (77) 1177627
29.5%
Common
ValueCountFrequency (%)
798791
87.9%
. 43248
 
4.8%
, 28386
 
3.1%
' 12905
 
1.4%
- 4294
 
0.5%
/ 3917
 
0.4%
" 2973
 
0.3%
) 2919
 
0.3%
( 2642
 
0.3%
? 1037
 
0.1%
Other values (61) 7561
 
0.8%
Cyrillic
ValueCountFrequency (%)
о 135
 
13.7%
е 79
 
8.0%
т 74
 
7.5%
а 66
 
6.7%
и 66
 
6.7%
н 57
 
5.8%
в 44
 
4.5%
с 41
 
4.2%
л 41
 
4.2%
р 37
 
3.8%
Other values (33) 345
35.0%
Arabic
ValueCountFrequency (%)
ا 98
15.0%
ل 69
 
10.6%
م 61
 
9.3%
ت 49
 
7.5%
ي 48
 
7.3%
و 39
 
6.0%
ه 26
 
4.0%
ن 25
 
3.8%
ع 24
 
3.7%
ف 22
 
3.4%
Other values (25) 193
29.5%
Bengali
ValueCountFrequency (%)
ি 10
 
9.2%
9
 
8.3%
9
 
8.3%
8
 
7.3%
7
 
6.4%
6
 
5.5%
6
 
5.5%
5
 
4.6%
4
 
3.7%
4
 
3.7%
Other values (24) 41
37.6%
Thai
ValueCountFrequency (%)
3
 
11.1%
2
 
7.4%
2
 
7.4%
2
 
7.4%
2
 
7.4%
2
 
7.4%
2
 
7.4%
1
 
3.7%
1
 
3.7%
1
 
3.7%
Other values (9) 9
33.3%
Hangul
ValueCountFrequency (%)
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
Other values (3) 3
23.1%
Hebrew
ValueCountFrequency (%)
ט 1
33.3%
ק 1
33.3%
ד 1
33.3%
Inherited
ValueCountFrequency (%)
13
59.1%
9
40.9%
Katakana
ValueCountFrequency (%)
6
100.0%
Hiragana
ValueCountFrequency (%)
1
100.0%
Greek
ValueCountFrequency (%)
Ι 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4904481
99.9%
Punctuation 1275
 
< 0.1%
Cyrillic 985
 
< 0.1%
Arabic 669
 
< 0.1%
CJK 355
 
< 0.1%
None 263
 
< 0.1%
Bengali 109
 
< 0.1%
Specials 82
 
< 0.1%
Thai 27
 
< 0.1%
VS 13
 
< 0.1%
Other values (9) 44
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
798791
16.3%
e 480081
 
9.8%
t 370357
 
7.6%
o 322144
 
6.6%
s 290297
 
5.9%
n 279124
 
5.7%
i 275200
 
5.6%
a 268785
 
5.5%
r 221407
 
4.5%
l 168687
 
3.4%
Other values (84) 1429608
29.1%
Punctuation
ValueCountFrequency (%)
1004
78.7%
78
 
6.1%
74
 
5.8%
28
 
2.2%
25
 
2.0%
23
 
1.8%
14
 
1.1%
11
 
0.9%
9
 
0.7%
6
 
0.5%
Other values (2) 3
 
0.2%
Cyrillic
ValueCountFrequency (%)
о 135
 
13.7%
е 79
 
8.0%
т 74
 
7.5%
а 66
 
6.7%
и 66
 
6.7%
н 57
 
5.8%
в 44
 
4.5%
с 41
 
4.2%
л 41
 
4.2%
р 37
 
3.8%
Other values (33) 345
35.0%
Arabic
ValueCountFrequency (%)
ا 98
14.6%
ل 69
 
10.3%
م 61
 
9.1%
ت 49
 
7.3%
ي 48
 
7.2%
و 39
 
5.8%
ه 26
 
3.9%
ن 25
 
3.7%
ع 24
 
3.6%
ف 22
 
3.3%
Other values (26) 208
31.1%
Specials
ValueCountFrequency (%)
82
100.0%
None
ValueCountFrequency (%)
ó 32
12.2%
í 30
11.4%
é 25
 
9.5%
á 23
 
8.7%
ı 18
 
6.8%
´ 16
 
6.1%
14
 
5.3%
¯ 12
 
4.6%
ã 10
 
3.8%
ú 10
 
3.8%
Other values (30) 73
27.8%
CJK
ValueCountFrequency (%)
19
 
5.4%
13
 
3.7%
8
 
2.3%
7
 
2.0%
6
 
1.7%
5
 
1.4%
5
 
1.4%
5
 
1.4%
5
 
1.4%
4
 
1.1%
Other values (190) 278
78.3%
VS
ValueCountFrequency (%)
13
100.0%
Bengali
ValueCountFrequency (%)
ি 10
 
9.2%
9
 
8.3%
9
 
8.3%
8
 
7.3%
7
 
6.4%
6
 
5.5%
6
 
5.5%
5
 
4.6%
4
 
3.7%
4
 
3.7%
Other values (24) 41
37.6%
Misc Symbols
ValueCountFrequency (%)
7
77.8%
2
 
22.2%
Katakana
ValueCountFrequency (%)
6
100.0%
Thai
ValueCountFrequency (%)
3
 
11.1%
2
 
7.4%
2
 
7.4%
2
 
7.4%
2
 
7.4%
2
 
7.4%
2
 
7.4%
1
 
3.7%
1
 
3.7%
1
 
3.7%
Other values (9) 9
33.3%
Dingbats
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%
Latin Ext Additional
ValueCountFrequency (%)
2
33.3%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Hangul
ValueCountFrequency (%)
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
Other values (3) 3
23.1%
Hiragana
ValueCountFrequency (%)
1
100.0%
Devanagari
ValueCountFrequency (%)
1
100.0%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%
Hebrew
ValueCountFrequency (%)
ט 1
33.3%
ק 1
33.3%
ד 1
33.3%

AISelect
Text

MISSING 

Distinct3
Distinct (%)< 0.1%
Missing1211
Missing (%)1.4%
Memory size1.4 MiB
2023-12-09T14:43:45.482778image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length23
Median length22
Mean length13.86594751
Min length3

Characters and Unicode

Total characters1219829
Distinct characters17
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowYes
2nd rowNo, and I don't plan to
3rd rowNo, and I don't plan to
4th rowYes
5th rowYes
ValueCountFrequency (%)
no 48931
14.7%
i 48931
14.7%
plan 48931
14.7%
to 48931
14.7%
yes 39042
11.7%
and 26221
7.9%
don't 26221
7.9%
but 22710
6.8%
soon 22710
6.8%
2023-12-09T14:43:45.645092image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
244655
20.1%
o 169503
13.9%
n 124083
10.2%
t 97862
 
8.0%
a 75152
 
6.2%
s 61752
 
5.1%
d 52442
 
4.3%
N 48931
 
4.0%
, 48931
 
4.0%
l 48931
 
4.0%
Other values (7) 247587
20.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 763118
62.6%
Space Separator 244655
 
20.1%
Uppercase Letter 136904
 
11.2%
Other Punctuation 75152
 
6.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o 169503
22.2%
n 124083
16.3%
t 97862
12.8%
a 75152
9.8%
s 61752
 
8.1%
d 52442
 
6.9%
l 48931
 
6.4%
p 48931
 
6.4%
e 39042
 
5.1%
b 22710
 
3.0%
Uppercase Letter
ValueCountFrequency (%)
N 48931
35.7%
I 48931
35.7%
Y 39042
28.5%
Other Punctuation
ValueCountFrequency (%)
, 48931
65.1%
' 26221
34.9%
Space Separator
ValueCountFrequency (%)
244655
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 900022
73.8%
Common 319807
 
26.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
o 169503
18.8%
n 124083
13.8%
t 97862
10.9%
a 75152
8.4%
s 61752
 
6.9%
d 52442
 
5.8%
N 48931
 
5.4%
l 48931
 
5.4%
I 48931
 
5.4%
p 48931
 
5.4%
Other values (4) 123504
13.7%
Common
ValueCountFrequency (%)
244655
76.5%
, 48931
 
15.3%
' 26221
 
8.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1219829
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
244655
20.1%
o 169503
13.9%
n 124083
10.2%
t 97862
 
8.0%
a 75152
 
6.2%
s 61752
 
5.1%
d 52442
 
4.3%
N 48931
 
4.0%
, 48931
 
4.0%
l 48931
 
4.0%
Other values (7) 247587
20.3%

AISent
Text

MISSING 

Distinct6
Distinct (%)< 0.1%
Missing27683
Missing (%)31.0%
Memory size1.4 MiB
2023-12-09T14:43:45.729474image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length16
Median length14
Mean length10.68177753
Min length6

Characters and Unicode

Total characters656940
Distinct characters20
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowIndifferent
2nd rowVery favorable
3rd rowFavorable
4th rowUnfavorable
5th rowFavorable
ValueCountFrequency (%)
favorable 46913
59.5%
very 17322
 
22.0%
indifferent 10147
 
12.9%
unsure 2471
 
3.1%
unfavorable 1970
 
2.5%
2023-12-09T14:43:45.875103image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 97766
14.9%
e 88970
13.5%
r 78823
12.0%
v 48883
7.4%
o 48883
7.4%
b 48883
7.4%
l 48883
7.4%
f 39314
 
6.0%
F 29863
 
4.5%
n 24735
 
3.8%
Other values (10) 101937
15.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 578117
88.0%
Uppercase Letter 61501
 
9.4%
Space Separator 17322
 
2.6%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 97766
16.9%
e 88970
15.4%
r 78823
13.6%
v 48883
8.5%
o 48883
8.5%
b 48883
8.5%
l 48883
8.5%
f 39314
6.8%
n 24735
 
4.3%
y 17322
 
3.0%
Other values (5) 35655
 
6.2%
Uppercase Letter
ValueCountFrequency (%)
F 29863
48.6%
V 17322
28.2%
I 10147
 
16.5%
U 4169
 
6.8%
Space Separator
ValueCountFrequency (%)
17322
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 639618
97.4%
Common 17322
 
2.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 97766
15.3%
e 88970
13.9%
r 78823
12.3%
v 48883
7.6%
o 48883
7.6%
b 48883
7.6%
l 48883
7.6%
f 39314
6.1%
F 29863
 
4.7%
n 24735
 
3.9%
Other values (9) 84615
13.2%
Common
ValueCountFrequency (%)
17322
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 656940
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 97766
14.9%
e 88970
13.5%
r 78823
12.0%
v 48883
7.4%
o 48883
7.4%
b 48883
7.4%
l 48883
7.4%
f 39314
 
6.0%
F 29863
 
4.5%
n 24735
 
3.8%
Other values (10) 101937
15.5%

AIAcc
Text

MISSING 

Distinct60
Distinct (%)0.2%
Missing50590
Missing (%)56.7%
Memory size1.4 MiB
2023-12-09T14:43:45.978745image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length130
Median length108
Mean length53.95934601
Min length17

Characters and Unicode

Total characters2082507
Distinct characters29
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3 ?
Unique (%)< 0.1%

Sample

1st rowOther (please explain)
2nd rowIncrease productivity;Greater efficiency;Speed up learning;Improve accuracy in coding
3rd rowGreater efficiency
4th rowImprove accuracy in coding
5th rowIncrease productivity;Speed up learning
ValueCountFrequency (%)
increase 31202
16.1%
up 24938
12.9%
productivity;greater 21877
11.3%
efficiency;speed 16168
8.4%
learning 14106
7.3%
accuracy 13189
6.8%
in 13189
6.8%
coding 10865
 
5.6%
learning;improve 10832
 
5.6%
productivity;speed 6430
 
3.3%
Other values (16) 30472
15.8%
2023-12-09T14:43:46.168510image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 264222
12.7%
r 175385
 
8.4%
i 171664
 
8.2%
c 170973
 
8.2%
154674
 
7.4%
n 139354
 
6.7%
a 120268
 
5.8%
p 103557
 
5.0%
t 95609
 
4.6%
o 73771
 
3.5%
Other values (19) 613030
29.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1759711
84.5%
Space Separator 154674
 
7.4%
Uppercase Letter 101227
 
4.9%
Other Punctuation 62633
 
3.0%
Open Punctuation 2131
 
0.1%
Close Punctuation 2131
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 264222
15.0%
r 175385
10.0%
i 171664
9.8%
c 170973
9.7%
n 139354
 
7.9%
a 120268
 
6.8%
p 103557
 
5.9%
t 95609
 
5.4%
o 73771
 
4.2%
d 70636
 
4.0%
Other values (11) 374272
21.3%
Uppercase Letter
ValueCountFrequency (%)
I 49419
48.8%
S 24938
24.6%
G 24739
24.4%
O 2131
 
2.1%
Space Separator
ValueCountFrequency (%)
154674
100.0%
Other Punctuation
ValueCountFrequency (%)
; 62633
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2131
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2131
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1860938
89.4%
Common 221569
 
10.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 264222
14.2%
r 175385
 
9.4%
i 171664
 
9.2%
c 170973
 
9.2%
n 139354
 
7.5%
a 120268
 
6.5%
p 103557
 
5.6%
t 95609
 
5.1%
o 73771
 
4.0%
d 70636
 
3.8%
Other values (15) 475499
25.6%
Common
ValueCountFrequency (%)
154674
69.8%
; 62633
28.3%
( 2131
 
1.0%
) 2131
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2082507
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 264222
12.7%
r 175385
 
8.4%
i 171664
 
8.2%
c 170973
 
8.2%
154674
 
7.4%
n 139354
 
6.7%
a 120268
 
5.8%
p 103557
 
5.0%
t 95609
 
4.6%
o 73771
 
3.5%
Other values (19) 613030
29.4%

AIBen
Text

MISSING 

Distinct5
Distinct (%)< 0.1%
Missing27788
Missing (%)31.2%
Memory size1.4 MiB
2023-12-09T14:43:46.258670image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length26
Median length17
Mean length18.33060786
Min length12

Characters and Unicode

Total characters1125426
Distinct characters20
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSomewhat distrust
2nd rowSomewhat trust
3rd rowSomewhat trust
4th rowSomewhat distrust
5th rowSomewhat distrust
ValueCountFrequency (%)
trust 44716
27.9%
somewhat 37458
23.3%
distrust 35517
22.1%
neither 18837
11.7%
nor 18837
11.7%
highly 5101
 
3.2%
2023-12-09T14:43:46.414006image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
t 216761
19.3%
r 117907
10.5%
s 115750
10.3%
99070
8.8%
u 80233
 
7.1%
e 75132
 
6.7%
h 61396
 
5.5%
i 59455
 
5.3%
o 56295
 
5.0%
S 37458
 
3.3%
Other values (10) 205969
18.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 964960
85.7%
Space Separator 99070
 
8.8%
Uppercase Letter 61396
 
5.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
t 216761
22.5%
r 117907
12.2%
s 115750
12.0%
u 80233
 
8.3%
e 75132
 
7.8%
h 61396
 
6.4%
i 59455
 
6.2%
o 56295
 
5.8%
a 37458
 
3.9%
w 37458
 
3.9%
Other values (6) 107115
11.1%
Uppercase Letter
ValueCountFrequency (%)
S 37458
61.0%
N 18837
30.7%
H 5101
 
8.3%
Space Separator
ValueCountFrequency (%)
99070
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1026356
91.2%
Common 99070
 
8.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
t 216761
21.1%
r 117907
11.5%
s 115750
11.3%
u 80233
 
7.8%
e 75132
 
7.3%
h 61396
 
6.0%
i 59455
 
5.8%
o 56295
 
5.5%
S 37458
 
3.6%
a 37458
 
3.6%
Other values (9) 168511
16.4%
Common
ValueCountFrequency (%)
99070
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1125426
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
t 216761
19.3%
r 117907
10.5%
s 115750
10.3%
99070
8.8%
u 80233
 
7.1%
e 75132
 
6.7%
h 61396
 
5.5%
i 59455
 
5.3%
o 56295
 
5.0%
S 37458
 
3.3%
Other values (10) 205969
18.3%
Distinct640
Distinct (%)2.0%
Missing56401
Missing (%)63.2%
Memory size1.4 MiB
2023-12-09T14:43:46.517848image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length222
Median length165
Mean length96.66592441
Min length12

Characters and Unicode

Total characters3168999
Distinct characters32
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique90 ?
Unique (%)0.3%

Sample

1st rowLearning about a codebase;Writing code;Debugging and getting help
2nd rowProject planning;Testing code;Committing and reviewing code;Deployment and monitoring;Collaborating with teammates
3rd rowLearning about a codebase;Documenting code;Testing code;Committing and reviewing code;Deployment and monitoring;Collaborating with teammates
4th rowProject planning;Writing code;Documenting code;Debugging and getting help;Testing code;Committing and reviewing code
5th rowLearning about a codebase;Writing code;Documenting code;Deployment and monitoring
ValueCountFrequency (%)
and 51142
16.4%
reviewing 18670
 
6.0%
learning 18467
 
5.9%
a 18467
 
5.9%
about 18467
 
5.9%
code;deployment 16014
 
5.1%
getting 15335
 
4.9%
code;committing 15186
 
4.9%
help;testing 11559
 
3.7%
code;debugging 11395
 
3.7%
Other values (45) 116309
37.4%
2023-12-09T14:43:46.704546image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
n 330046
 
10.4%
e 318713
 
10.1%
289408
 
9.1%
i 253163
 
8.0%
o 230471
 
7.3%
t 229488
 
7.2%
g 224155
 
7.1%
a 185050
 
5.8%
d 137262
 
4.3%
c 119599
 
3.8%
Other values (22) 851644
26.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 2622940
82.8%
Space Separator 289408
 
9.1%
Uppercase Letter 144431
 
4.6%
Other Punctuation 111648
 
3.5%
Open Punctuation 286
 
< 0.1%
Close Punctuation 286
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n 330046
12.6%
e 318713
12.2%
i 253163
9.7%
o 230471
8.8%
t 229488
8.7%
g 224155
8.5%
a 185050
 
7.1%
d 137262
 
5.2%
c 119599
 
4.6%
m 113169
 
4.3%
Other values (11) 481824
18.4%
Uppercase Letter
ValueCountFrequency (%)
D 51417
35.6%
C 29975
20.8%
T 20807
14.4%
L 18467
 
12.8%
P 14534
 
10.1%
W 8945
 
6.2%
O 286
 
0.2%
Space Separator
ValueCountFrequency (%)
289408
100.0%
Other Punctuation
ValueCountFrequency (%)
; 111648
100.0%
Open Punctuation
ValueCountFrequency (%)
( 286
100.0%
Close Punctuation
ValueCountFrequency (%)
) 286
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 2767371
87.3%
Common 401628
 
12.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
n 330046
11.9%
e 318713
11.5%
i 253163
9.1%
o 230471
 
8.3%
t 229488
 
8.3%
g 224155
 
8.1%
a 185050
 
6.7%
d 137262
 
5.0%
c 119599
 
4.3%
m 113169
 
4.1%
Other values (18) 626255
22.6%
Common
ValueCountFrequency (%)
289408
72.1%
; 111648
 
27.8%
( 286
 
0.1%
) 286
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3168999
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
n 330046
 
10.4%
e 318713
 
10.1%
289408
 
9.1%
i 253163
 
8.0%
o 230471
 
7.3%
t 229488
 
7.2%
g 224155
 
7.1%
a 185050
 
5.8%
d 137262
 
4.3%
c 119599
 
3.8%
Other values (22) 851644
26.9%

AIToolCurrently Using
Text

MISSING 

Distinct533
Distinct (%)1.5%
Missing53047
Missing (%)59.5%
Memory size1.4 MiB
2023-12-09T14:43:46.832645image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length222
Median length183
Mean length49.84805047
Min length12

Characters and Unicode

Total characters1801359
Distinct characters32
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique134 ?
Unique (%)0.4%

Sample

1st rowWriting code;Committing and reviewing code
2nd rowLearning about a codebase;Writing code;Documenting code;Debugging and getting help
3rd rowWriting code;Debugging and getting help
4th rowWriting code;Debugging and getting help
5th rowProject planning;Writing code;Debugging and getting help
ValueCountFrequency (%)
and 24031
11.9%
code 21205
10.5%
writing 19882
9.8%
getting 18437
 
9.1%
code;debugging 16301
 
8.0%
code;documenting 11681
 
5.8%
a 11350
 
5.6%
about 11350
 
5.6%
learning 11350
 
5.6%
help 10964
 
5.4%
Other values (45) 46103
22.7%
2023-12-09T14:43:47.043315image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 190158
10.6%
n 179306
10.0%
g 172503
 
9.6%
167835
 
9.3%
i 159679
 
8.9%
t 123690
 
6.9%
o 109584
 
6.1%
d 92860
 
5.2%
c 86889
 
4.8%
a 80615
 
4.5%
Other values (22) 438240
24.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1477447
82.0%
Space Separator 167835
 
9.3%
Uppercase Letter 95528
 
5.3%
Other Punctuation 59391
 
3.3%
Open Punctuation 579
 
< 0.1%
Close Punctuation 579
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 190158
12.9%
n 179306
12.1%
g 172503
11.7%
i 159679
10.8%
t 123690
8.4%
o 109584
7.4%
d 92860
6.3%
c 86889
 
5.9%
a 80615
 
5.5%
r 55707
 
3.8%
Other values (11) 226456
15.3%
Uppercase Letter
ValueCountFrequency (%)
D 33188
34.7%
W 31131
32.6%
L 11350
 
11.9%
T 9000
 
9.4%
C 5183
 
5.4%
P 5097
 
5.3%
O 579
 
0.6%
Space Separator
ValueCountFrequency (%)
167835
100.0%
Other Punctuation
ValueCountFrequency (%)
; 59391
100.0%
Open Punctuation
ValueCountFrequency (%)
( 579
100.0%
Close Punctuation
ValueCountFrequency (%)
) 579
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1572975
87.3%
Common 228384
 
12.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 190158
12.1%
n 179306
11.4%
g 172503
11.0%
i 159679
10.2%
t 123690
 
7.9%
o 109584
 
7.0%
d 92860
 
5.9%
c 86889
 
5.5%
a 80615
 
5.1%
r 55707
 
3.5%
Other values (18) 321984
20.5%
Common
ValueCountFrequency (%)
167835
73.5%
; 59391
 
26.0%
( 579
 
0.3%
) 579
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1801359
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 190158
10.6%
n 179306
10.0%
g 172503
 
9.6%
167835
 
9.3%
i 159679
 
8.9%
t 123690
 
6.9%
o 109584
 
6.1%
d 92860
 
5.2%
c 86889
 
4.8%
a 80615
 
4.5%
Other values (22) 438240
24.3%
Distinct535
Distinct (%)2.5%
Missing68115
Missing (%)76.4%
Memory size1.4 MiB
2023-12-09T14:43:47.177337image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length222
Median length192
Mean length71.45574066
Min length12

Characters and Unicode

Total characters1505501
Distinct characters32
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique110 ?
Unique (%)0.5%

Sample

1st rowProject planning
2nd rowLearning about a codebase;Project planning;Writing code;Documenting code;Testing code;Committing and reviewing code;Deployment and monitoring;Collaborating with teammates
3rd rowProject planning;Testing code;Deployment and monitoring;Collaborating with teammates
4th rowProject planning;Documenting code;Committing and reviewing code;Collaborating with teammates
5th rowDeployment and monitoring
ValueCountFrequency (%)
and 21737
15.2%
teammates 15606
 
10.9%
with 15606
 
10.9%
monitoring;collaborating 9239
 
6.5%
reviewing 8654
 
6.1%
project 7915
 
5.5%
code;deployment 7359
 
5.2%
about 4936
 
3.5%
a 4936
 
3.5%
learning 4936
 
3.5%
Other values (44) 41826
29.3%
2023-12-09T14:43:47.391386image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
n 147142
 
9.8%
137032
 
9.1%
t 131410
 
8.7%
e 129687
 
8.6%
i 119196
 
7.9%
a 115433
 
7.7%
o 113755
 
7.6%
g 80812
 
5.4%
m 72926
 
4.8%
l 55823
 
3.7%
Other values (22) 402285
26.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1263226
83.9%
Space Separator 137032
 
9.1%
Uppercase Letter 62855
 
4.2%
Other Punctuation 41786
 
2.8%
Open Punctuation 301
 
< 0.1%
Close Punctuation 301
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n 147142
11.6%
t 131410
10.4%
e 129687
10.3%
i 119196
9.4%
a 115433
9.1%
o 113755
9.0%
g 80812
 
6.4%
m 72926
 
5.8%
l 55823
 
4.4%
r 53397
 
4.2%
Other values (11) 243645
19.3%
Uppercase Letter
ValueCountFrequency (%)
C 24260
38.6%
D 16125
25.7%
P 11227
17.9%
L 4936
 
7.9%
T 4316
 
6.9%
W 1690
 
2.7%
O 301
 
0.5%
Space Separator
ValueCountFrequency (%)
137032
100.0%
Other Punctuation
ValueCountFrequency (%)
; 41786
100.0%
Open Punctuation
ValueCountFrequency (%)
( 301
100.0%
Close Punctuation
ValueCountFrequency (%)
) 301
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1326081
88.1%
Common 179420
 
11.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
n 147142
11.1%
t 131410
9.9%
e 129687
9.8%
i 119196
 
9.0%
a 115433
 
8.7%
o 113755
 
8.6%
g 80812
 
6.1%
m 72926
 
5.5%
l 55823
 
4.2%
r 53397
 
4.0%
Other values (18) 306500
23.1%
Common
ValueCountFrequency (%)
137032
76.4%
; 41786
 
23.3%
( 301
 
0.2%
) 301
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1505501
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
n 147142
 
9.8%
137032
 
9.1%
t 131410
 
8.7%
e 129687
 
8.6%
i 119196
 
7.9%
a 115433
 
7.7%
o 113755
 
7.6%
g 80812
 
5.4%
m 72926
 
4.8%
l 55823
 
3.7%
Other values (22) 402285
26.7%

AINextVery different
Text

MISSING 

Distinct349
Distinct (%)2.8%
Missing76523
Missing (%)85.8%
Memory size1.4 MiB
2023-12-09T14:43:47.517468image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length222
Median length172
Mean length41.31893215
Min length12

Characters and Unicode

Total characters523139
Distinct characters32
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique110 ?
Unique (%)0.9%

Sample

1st rowWriting code;Documenting code;Debugging and getting help
2nd rowWriting code;Debugging and getting help
3rd rowLearning about a codebase;Documenting code
4th rowLearning about a codebase;Project planning;Writing code
5th rowDebugging and getting help;Testing code
ValueCountFrequency (%)
and 7535
12.2%
code 7045
11.4%
getting 5857
 
9.5%
writing 4241
 
6.9%
code;debugging 3859
 
6.3%
help 3788
 
6.2%
about 3428
 
5.6%
a 3428
 
5.6%
learning 3428
 
5.6%
code;documenting 2350
 
3.8%
Other values (45) 16577
26.9%
2023-12-09T14:43:47.724958image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 56758
10.8%
n 53436
10.2%
g 51508
9.8%
49254
 
9.4%
i 43730
 
8.4%
t 35718
 
6.8%
o 31799
 
6.1%
d 26369
 
5.0%
c 24912
 
4.8%
a 24276
 
4.6%
Other values (22) 125379
24.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 432390
82.7%
Space Separator 49254
 
9.4%
Uppercase Letter 27007
 
5.2%
Other Punctuation 14346
 
2.7%
Open Punctuation 71
 
< 0.1%
Close Punctuation 71
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 56758
13.1%
n 53436
12.4%
g 51508
11.9%
i 43730
10.1%
t 35718
8.3%
o 31799
7.4%
d 26369
 
6.1%
c 24912
 
5.8%
a 24276
 
5.6%
u 13973
 
3.2%
Other values (11) 69911
16.2%
Uppercase Letter
ValueCountFrequency (%)
D 11079
41.0%
W 6508
24.1%
L 3428
 
12.7%
T 2995
 
11.1%
C 1536
 
5.7%
P 1390
 
5.1%
O 71
 
0.3%
Space Separator
ValueCountFrequency (%)
49254
100.0%
Other Punctuation
ValueCountFrequency (%)
; 14346
100.0%
Open Punctuation
ValueCountFrequency (%)
( 71
100.0%
Close Punctuation
ValueCountFrequency (%)
) 71
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 459397
87.8%
Common 63742
 
12.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 56758
12.4%
n 53436
11.6%
g 51508
11.2%
i 43730
9.5%
t 35718
 
7.8%
o 31799
 
6.9%
d 26369
 
5.7%
c 24912
 
5.4%
a 24276
 
5.3%
u 13973
 
3.0%
Other values (18) 96918
21.1%
Common
ValueCountFrequency (%)
49254
77.3%
; 14346
 
22.5%
( 71
 
0.1%
) 71
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 523139
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 56758
10.8%
n 53436
10.2%
g 51508
9.8%
49254
 
9.4%
i 43730
 
8.4%
t 35718
 
6.8%
o 31799
 
6.1%
d 26369
 
5.0%
c 24912
 
4.8%
a 24276
 
4.6%
Other values (22) 125379
24.0%
Distinct222
Distinct (%)3.4%
Missing82585
Missing (%)92.6%
Memory size1.4 MiB
2023-12-09T14:43:47.852119image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length222
Median length198
Mean length27.54523413
Min length12

Characters and Unicode

Total characters181771
Distinct characters32
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique91 ?
Unique (%)1.4%

Sample

1st rowWriting code
2nd rowWriting code
3rd rowLearning about a codebase;Writing code;Documenting code;Debugging and getting help
4th rowDocumenting code
5th rowWriting code
ValueCountFrequency (%)
code 3913
16.7%
writing 2636
 
11.2%
and 2532
 
10.8%
getting 1668
 
7.1%
help 1270
 
5.4%
about 1085
 
4.6%
a 1085
 
4.6%
learning 1085
 
4.6%
debugging 915
 
3.9%
code;debugging 659
 
2.8%
Other values (41) 6649
28.3%
2023-12-09T14:43:48.058573image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 19073
10.5%
n 18437
10.1%
17149
9.4%
g 16857
 
9.3%
i 16649
 
9.2%
t 12957
 
7.1%
o 11618
 
6.4%
d 9359
 
5.1%
c 8611
 
4.7%
a 8551
 
4.7%
Other values (22) 42510
23.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 151849
83.5%
Space Separator 17149
 
9.4%
Uppercase Letter 9648
 
5.3%
Other Punctuation 3049
 
1.7%
Open Punctuation 38
 
< 0.1%
Close Punctuation 38
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 19073
12.6%
n 18437
12.1%
g 16857
11.1%
i 16649
11.0%
t 12957
8.5%
o 11618
7.7%
d 9359
 
6.2%
c 8611
 
5.7%
a 8551
 
5.6%
r 5949
 
3.9%
Other values (11) 23788
15.7%
Uppercase Letter
ValueCountFrequency (%)
D 3136
32.5%
W 3060
31.7%
L 1085
 
11.2%
T 890
 
9.2%
C 834
 
8.6%
P 605
 
6.3%
O 38
 
0.4%
Space Separator
ValueCountFrequency (%)
17149
100.0%
Other Punctuation
ValueCountFrequency (%)
; 3049
100.0%
Open Punctuation
ValueCountFrequency (%)
( 38
100.0%
Close Punctuation
ValueCountFrequency (%)
) 38
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 161497
88.8%
Common 20274
 
11.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 19073
11.8%
n 18437
11.4%
g 16857
10.4%
i 16649
10.3%
t 12957
 
8.0%
o 11618
 
7.2%
d 9359
 
5.8%
c 8611
 
5.3%
a 8551
 
5.3%
r 5949
 
3.7%
Other values (18) 33436
20.7%
Common
ValueCountFrequency (%)
17149
84.6%
; 3049
 
15.0%
( 38
 
0.2%
) 38
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 181771
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 19073
10.5%
n 18437
10.1%
17149
9.4%
g 16857
 
9.3%
i 16649
 
9.2%
t 12957
 
7.1%
o 11618
 
6.4%
d 9359
 
5.1%
c 8611
 
4.7%
a 8551
 
4.7%
Other values (22) 42510
23.4%
Distinct197
Distinct (%)3.2%
Missing82946
Missing (%)93.0%
Memory size1.4 MiB
2023-12-09T14:43:48.182694image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length222
Median length185
Mean length28.22138506
Min length12

Characters and Unicode

Total characters176045
Distinct characters32
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique81 ?
Unique (%)1.3%

Sample

1st rowDebugging and getting help
2nd rowWriting code
3rd rowWriting code
4th rowCommitting and reviewing code
5th rowLearning about a codebase;Writing code;Debugging and getting help
ValueCountFrequency (%)
code 4075
18.1%
writing 3349
14.9%
and 2167
9.6%
getting 1595
 
7.1%
help 1214
 
5.4%
a 1063
 
4.7%
learning 1063
 
4.7%
about 1063
 
4.7%
code;debugging 902
 
4.0%
code;documenting 696
 
3.1%
Other values (44) 5340
23.7%
2023-12-09T14:43:48.381654image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 18382
10.4%
n 17481
9.9%
i 16825
9.6%
g 16535
9.4%
16433
9.3%
t 12432
 
7.1%
o 11290
 
6.4%
d 9553
 
5.4%
c 9028
 
5.1%
a 7563
 
4.3%
Other values (22) 40523
23.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 146146
83.0%
Space Separator 16433
 
9.3%
Uppercase Letter 9809
 
5.6%
Other Punctuation 3571
 
2.0%
Open Punctuation 43
 
< 0.1%
Close Punctuation 43
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 18382
12.6%
n 17481
12.0%
i 16825
11.5%
g 16535
11.3%
t 12432
8.5%
o 11290
7.7%
d 9553
6.5%
c 9028
6.2%
a 7563
 
5.2%
r 6285
 
4.3%
Other values (11) 20772
14.2%
Uppercase Letter
ValueCountFrequency (%)
W 3919
40.0%
D 2927
29.8%
L 1063
 
10.8%
T 823
 
8.4%
C 541
 
5.5%
P 493
 
5.0%
O 43
 
0.4%
Space Separator
ValueCountFrequency (%)
16433
100.0%
Other Punctuation
ValueCountFrequency (%)
; 3571
100.0%
Open Punctuation
ValueCountFrequency (%)
( 43
100.0%
Close Punctuation
ValueCountFrequency (%)
) 43
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 155955
88.6%
Common 20090
 
11.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 18382
11.8%
n 17481
11.2%
i 16825
10.8%
g 16535
10.6%
t 12432
8.0%
o 11290
 
7.2%
d 9553
 
6.1%
c 9028
 
5.8%
a 7563
 
4.8%
r 6285
 
4.0%
Other values (18) 30581
19.6%
Common
ValueCountFrequency (%)
16433
81.8%
; 3571
 
17.8%
( 43
 
0.2%
) 43
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 176045
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 18382
10.4%
n 17481
9.9%
i 16825
9.6%
g 16535
9.4%
16433
9.3%
t 12432
 
7.1%
o 11290
 
6.4%
d 9553
 
5.4%
c 9028
 
5.1%
a 7563
 
4.3%
Other values (22) 40523
23.0%

AINextVery similar
Text

MISSING 

Distinct159
Distinct (%)6.1%
Missing86563
Missing (%)97.1%
Memory size1.4 MiB
2023-12-09T14:43:48.512512image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length222
Median length183
Mean length30.78176269
Min length12

Characters and Unicode

Total characters80679
Distinct characters32
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique71 ?
Unique (%)2.7%

Sample

1st rowLearning about a codebase;Debugging and getting help
2nd rowDebugging and getting help
3rd rowWriting code
4th rowDocumenting code
5th rowDocumenting code;Debugging and getting help;Testing code
ValueCountFrequency (%)
code 1705
17.2%
writing 1405
14.2%
and 1006
 
10.2%
getting 644
 
6.5%
help 433
 
4.4%
learning 400
 
4.0%
about 400
 
4.0%
a 400
 
4.0%
code;debugging 392
 
4.0%
code;documenting 351
 
3.5%
Other values (41) 2771
28.0%
2023-12-09T14:43:48.725494image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 8379
10.4%
n 7952
9.9%
i 7650
 
9.5%
7409
 
9.2%
g 7165
 
8.9%
t 5834
 
7.2%
o 5325
 
6.6%
d 4275
 
5.3%
c 4049
 
5.0%
a 3423
 
4.2%
Other values (22) 19218
23.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 66929
83.0%
Space Separator 7409
 
9.2%
Uppercase Letter 4429
 
5.5%
Other Punctuation 1808
 
2.2%
Open Punctuation 52
 
0.1%
Close Punctuation 52
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 8379
12.5%
n 7952
11.9%
i 7650
11.4%
g 7165
10.7%
t 5834
8.7%
o 5325
8.0%
d 4275
6.4%
c 4049
6.0%
a 3423
 
5.1%
r 2890
 
4.3%
Other values (11) 9987
14.9%
Uppercase Letter
ValueCountFrequency (%)
W 1658
37.4%
D 1341
30.3%
L 400
 
9.0%
T 400
 
9.0%
C 345
 
7.8%
P 233
 
5.3%
O 52
 
1.2%
Space Separator
ValueCountFrequency (%)
7409
100.0%
Other Punctuation
ValueCountFrequency (%)
; 1808
100.0%
Open Punctuation
ValueCountFrequency (%)
( 52
100.0%
Close Punctuation
ValueCountFrequency (%)
) 52
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 71358
88.4%
Common 9321
 
11.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 8379
11.7%
n 7952
11.1%
i 7650
10.7%
g 7165
10.0%
t 5834
8.2%
o 5325
 
7.5%
d 4275
 
6.0%
c 4049
 
5.7%
a 3423
 
4.8%
r 2890
 
4.1%
Other values (18) 14416
20.2%
Common
ValueCountFrequency (%)
7409
79.5%
; 1808
 
19.4%
( 52
 
0.6%
) 52
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 80679
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 8379
10.4%
n 7952
9.9%
i 7650
 
9.5%
7409
 
9.2%
g 7165
 
8.9%
t 5834
 
7.2%
o 5325
 
6.6%
d 4275
 
5.3%
c 4049
 
5.0%
a 3423
 
4.2%
Other values (22) 19218
23.8%
Distinct326
Distinct (%)1.4%
Missing65881
Missing (%)73.9%
Memory size1.4 MiB
2023-12-09T14:43:48.859844image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length222
Median length185
Mean length33.39119427
Min length12

Characters and Unicode

Total characters778115
Distinct characters32
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique92 ?
Unique (%)0.4%

Sample

1st rowDocumenting code;Debugging and getting help
2nd rowDocumenting code;Committing and reviewing code
3rd rowWriting code
4th rowWriting code;Documenting code;Debugging and getting help;Testing code
5th rowWriting code;Debugging and getting help
ValueCountFrequency (%)
code 14398
15.0%
writing 12435
12.9%
and 9940
10.3%
getting 7987
 
8.3%
help 5824
 
6.1%
code;debugging 5482
 
5.7%
learning 5300
 
5.5%
about 5300
 
5.5%
a 5300
 
5.5%
code;documenting 3332
 
3.5%
Other values (45) 20924
21.7%
2023-12-09T14:43:49.072212image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 81841
10.5%
n 77744
10.0%
g 75755
9.7%
73314
9.4%
i 71489
 
9.2%
t 53680
 
6.9%
o 47747
 
6.1%
d 41248
 
5.3%
c 38595
 
5.0%
a 35082
 
4.5%
Other values (22) 181620
23.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 642868
82.6%
Space Separator 73314
 
9.4%
Uppercase Letter 42531
 
5.5%
Other Punctuation 19228
 
2.5%
Open Punctuation 87
 
< 0.1%
Close Punctuation 87
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 81841
12.7%
n 77744
12.1%
g 75755
11.8%
i 71489
11.1%
t 53680
8.4%
o 47747
7.4%
d 41248
6.4%
c 38595
6.0%
a 35082
 
5.5%
r 25962
 
4.0%
Other values (11) 93725
14.6%
Uppercase Letter
ValueCountFrequency (%)
W 15889
37.4%
D 13621
32.0%
L 5300
 
12.5%
T 3625
 
8.5%
P 2243
 
5.3%
C 1766
 
4.2%
O 87
 
0.2%
Space Separator
ValueCountFrequency (%)
73314
100.0%
Other Punctuation
ValueCountFrequency (%)
; 19228
100.0%
Open Punctuation
ValueCountFrequency (%)
( 87
100.0%
Close Punctuation
ValueCountFrequency (%)
) 87
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 685399
88.1%
Common 92716
 
11.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 81841
11.9%
n 77744
11.3%
g 75755
11.1%
i 71489
10.4%
t 53680
 
7.8%
o 47747
 
7.0%
d 41248
 
6.0%
c 38595
 
5.6%
a 35082
 
5.1%
r 25962
 
3.8%
Other values (18) 136256
19.9%
Common
ValueCountFrequency (%)
73314
79.1%
; 19228
 
20.7%
( 87
 
0.1%
) 87
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 778115
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 81841
10.5%
n 77744
10.0%
g 75755
9.7%
73314
9.4%
i 71489
 
9.2%
t 53680
 
6.9%
o 47747
 
6.1%
d 41248
 
5.3%
c 38595
 
5.0%
a 35082
 
4.5%
Other values (22) 181620
23.3%

TBranch
Text

MISSING 

Distinct2
Distinct (%)< 0.1%
Missing23416
Missing (%)26.3%
Memory size1.4 MiB
2023-12-09T14:43:49.145995image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.667072132
Min length2

Characters and Unicode

Total characters175408
Distinct characters5
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowYes
2nd rowYes
3rd rowYes
4th rowYes
5th rowYes
ValueCountFrequency (%)
yes 43872
66.7%
no 21896
33.3%
2023-12-09T14:43:49.268492image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
Y 43872
25.0%
e 43872
25.0%
s 43872
25.0%
N 21896
12.5%
o 21896
12.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 109640
62.5%
Uppercase Letter 65768
37.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 43872
40.0%
s 43872
40.0%
o 21896
20.0%
Uppercase Letter
ValueCountFrequency (%)
Y 43872
66.7%
N 21896
33.3%

Most occurring scripts

ValueCountFrequency (%)
Latin 175408
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
Y 43872
25.0%
e 43872
25.0%
s 43872
25.0%
N 21896
12.5%
o 21896
12.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 175408
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
Y 43872
25.0%
e 43872
25.0%
s 43872
25.0%
N 21896
12.5%
o 21896
12.5%

ICorPM
Text

MISSING 

Distinct2
Distinct (%)< 0.1%
Missing45516
Missing (%)51.0%
Memory size1.4 MiB
2023-12-09T14:43:49.353392image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length22
Median length22
Mean length20.90391133
Min length14

Characters and Unicode

Total characters912832
Distinct characters19
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPeople manager
2nd rowIndividual contributor
3rd rowIndividual contributor
4th rowIndividual contributor
5th rowIndividual contributor
ValueCountFrequency (%)
individual 37685
43.1%
contributor 37685
43.1%
people 5983
 
6.9%
manager 5983
 
6.9%
2023-12-09T14:43:49.512417image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
i 113055
12.4%
r 81353
8.9%
n 81353
8.9%
o 81353
8.9%
d 75370
 
8.3%
u 75370
 
8.3%
t 75370
 
8.3%
a 49651
 
5.4%
l 43668
 
4.8%
43668
 
4.8%
Other values (9) 192621
21.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 825496
90.4%
Space Separator 43668
 
4.8%
Uppercase Letter 43668
 
4.8%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i 113055
13.7%
r 81353
9.9%
n 81353
9.9%
o 81353
9.9%
d 75370
9.1%
u 75370
9.1%
t 75370
9.1%
a 49651
6.0%
l 43668
 
5.3%
b 37685
 
4.6%
Other values (6) 111268
13.5%
Uppercase Letter
ValueCountFrequency (%)
I 37685
86.3%
P 5983
 
13.7%
Space Separator
ValueCountFrequency (%)
43668
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 869164
95.2%
Common 43668
 
4.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
i 113055
13.0%
r 81353
9.4%
n 81353
9.4%
o 81353
9.4%
d 75370
8.7%
u 75370
8.7%
t 75370
8.7%
a 49651
 
5.7%
l 43668
 
5.0%
I 37685
 
4.3%
Other values (8) 154936
17.8%
Common
ValueCountFrequency (%)
43668
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 912832
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i 113055
12.4%
r 81353
8.9%
n 81353
8.9%
o 81353
8.9%
d 75370
 
8.3%
u 75370
 
8.3%
t 75370
 
8.3%
a 49651
 
5.4%
l 43668
 
4.8%
43668
 
4.8%
Other values (9) 192621
21.1%

WorkExp
Real number (ℝ)

MISSING 

Distinct51
Distinct (%)0.1%
Missing45605
Missing (%)51.1%
Infinite0
Infinite (%)0.0%
Mean11.40512632
Minimum0
Maximum50
Zeros261
Zeros (%)0.3%
Negative0
Negative (%)0.0%
Memory size1.4 MiB
2023-12-09T14:43:49.602299image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q15
median9
Q316
95-th percentile30
Maximum50
Range50
Interquartile range (IQR)11

Descriptive statistics

Standard deviation9.051989418
Coefficient of variation (CV)0.7936772608
Kurtosis1.384979476
Mean11.40512632
Median Absolute Deviation (MAD)5
Skewness1.235573844
Sum497024
Variance81.93851243
MonotonicityNot monotonic
2023-12-09T14:43:49.680643image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5 3299
 
3.7%
10 3019
 
3.4%
2 2796
 
3.1%
3 2781
 
3.1%
4 2529
 
2.8%
7 2422
 
2.7%
6 2329
 
2.6%
1 2271
 
2.5%
8 2232
 
2.5%
15 2001
 
2.2%
Other values (41) 17900
 
20.1%
(Missing) 45605
51.1%
ValueCountFrequency (%)
0 261
 
0.3%
1 2271
2.5%
2 2796
3.1%
3 2781
3.1%
4 2529
2.8%
ValueCountFrequency (%)
50 77
0.1%
49 4
 
< 0.1%
48 12
 
< 0.1%
47 17
 
< 0.1%
46 26
 
< 0.1%

Knowledge_1
Text

MISSING 

Distinct5
Distinct (%)< 0.1%
Missing46649
Missing (%)52.3%
Memory size1.4 MiB
2023-12-09T14:43:49.742626image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length26
Median length17
Mean length10.52674268
Min length5

Characters and Unicode

Total characters447755
Distinct characters18
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowStrongly agree
2nd rowStrongly agree
3rd rowStrongly agree
4th rowAgree
5th rowAgree
ValueCountFrequency (%)
agree 38907
55.7%
strongly 16527
23.7%
disagree 7221
 
10.3%
neither 3593
 
5.1%
nor 3593
 
5.1%
2023-12-09T14:43:49.878395image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 99442
22.2%
r 69841
15.6%
g 62655
14.0%
27306
 
6.1%
a 26316
 
5.9%
t 20120
 
4.5%
o 20120
 
4.5%
n 20120
 
4.5%
A 19812
 
4.4%
y 16527
 
3.7%
Other values (8) 65496
14.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 377914
84.4%
Uppercase Letter 42535
 
9.5%
Space Separator 27306
 
6.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 99442
26.3%
r 69841
18.5%
g 62655
16.6%
a 26316
 
7.0%
t 20120
 
5.3%
o 20120
 
5.3%
n 20120
 
5.3%
y 16527
 
4.4%
l 16527
 
4.4%
i 10814
 
2.9%
Other values (3) 15432
 
4.1%
Uppercase Letter
ValueCountFrequency (%)
A 19812
46.6%
S 16527
38.9%
N 3593
 
8.4%
D 2603
 
6.1%
Space Separator
ValueCountFrequency (%)
27306
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 420449
93.9%
Common 27306
 
6.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 99442
23.7%
r 69841
16.6%
g 62655
14.9%
a 26316
 
6.3%
t 20120
 
4.8%
o 20120
 
4.8%
n 20120
 
4.8%
A 19812
 
4.7%
y 16527
 
3.9%
l 16527
 
3.9%
Other values (7) 48969
11.6%
Common
ValueCountFrequency (%)
27306
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 447755
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 99442
22.2%
r 69841
15.6%
g 62655
14.0%
27306
 
6.1%
a 26316
 
5.9%
t 20120
 
4.5%
o 20120
 
4.5%
n 20120
 
4.5%
A 19812
 
4.4%
y 16527
 
3.7%
Other values (8) 65496
14.6%

Knowledge_2
Text

MISSING 

Distinct5
Distinct (%)< 0.1%
Missing47514
Missing (%)53.3%
Memory size1.4 MiB
2023-12-09T14:43:49.963585image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length26
Median length17
Mean length13.0600432
Min length5

Characters and Unicode

Total characters544212
Distinct characters18
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowAgree
2nd rowNeither agree nor disagree
3rd rowStrongly disagree
4th rowStrongly agree
5th rowStrongly agree
ValueCountFrequency (%)
agree 29320
35.8%
disagree 22986
28.0%
neither 10636
 
13.0%
nor 10636
 
13.0%
strongly 8384
 
10.2%
2023-12-09T14:43:50.107949image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 125884
23.1%
r 81962
15.1%
g 60690
11.2%
40292
 
7.4%
a 38842
 
7.1%
i 33622
 
6.2%
s 22986
 
4.2%
o 19020
 
3.5%
t 19020
 
3.5%
n 19020
 
3.5%
Other values (8) 82874
15.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 462250
84.9%
Uppercase Letter 41670
 
7.7%
Space Separator 40292
 
7.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 125884
27.2%
r 81962
17.7%
g 60690
13.1%
a 38842
 
8.4%
i 33622
 
7.3%
s 22986
 
5.0%
o 19020
 
4.1%
t 19020
 
4.1%
n 19020
 
4.1%
d 13800
 
3.0%
Other values (3) 27404
 
5.9%
Uppercase Letter
ValueCountFrequency (%)
A 13464
32.3%
N 10636
25.5%
D 9186
22.0%
S 8384
20.1%
Space Separator
ValueCountFrequency (%)
40292
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 503920
92.6%
Common 40292
 
7.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 125884
25.0%
r 81962
16.3%
g 60690
12.0%
a 38842
 
7.7%
i 33622
 
6.7%
s 22986
 
4.6%
o 19020
 
3.8%
t 19020
 
3.8%
n 19020
 
3.8%
d 13800
 
2.7%
Other values (7) 69074
13.7%
Common
ValueCountFrequency (%)
40292
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 544212
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 125884
23.1%
r 81962
15.1%
g 60690
11.2%
40292
 
7.4%
a 38842
 
7.1%
i 33622
 
6.2%
s 22986
 
4.2%
o 19020
 
3.5%
t 19020
 
3.5%
n 19020
 
3.5%
Other values (8) 82874
15.2%

Knowledge_3
Text

MISSING 

Distinct5
Distinct (%)< 0.1%
Missing47386
Missing (%)53.1%
Memory size1.4 MiB
2023-12-09T14:43:50.190271image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length26
Median length17
Mean length12.57974066
Min length5

Characters and Unicode

Total characters525808
Distinct characters18
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowStrongly agree
2nd rowAgree
3rd rowStrongly agree
4th rowAgree
5th rowDisagree
ValueCountFrequency (%)
agree 32017
39.5%
disagree 20877
25.7%
neither 11096
 
13.7%
nor 11096
 
13.7%
strongly 6051
 
7.5%
2023-12-09T14:43:50.332944image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 127980
24.3%
r 81137
15.4%
g 58945
11.2%
39339
 
7.5%
a 36029
 
6.9%
i 31973
 
6.1%
s 20877
 
4.0%
n 17147
 
3.3%
t 17147
 
3.3%
o 17147
 
3.3%
Other values (8) 78087
14.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 444671
84.6%
Uppercase Letter 41798
 
7.9%
Space Separator 39339
 
7.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 127980
28.8%
r 81137
18.2%
g 58945
13.3%
a 36029
 
8.1%
i 31973
 
7.2%
s 20877
 
4.7%
n 17147
 
3.9%
t 17147
 
3.9%
o 17147
 
3.9%
d 13091
 
2.9%
Other values (3) 23198
 
5.2%
Uppercase Letter
ValueCountFrequency (%)
A 16865
40.3%
N 11096
26.5%
D 7786
18.6%
S 6051
 
14.5%
Space Separator
ValueCountFrequency (%)
39339
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 486469
92.5%
Common 39339
 
7.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 127980
26.3%
r 81137
16.7%
g 58945
12.1%
a 36029
 
7.4%
i 31973
 
6.6%
s 20877
 
4.3%
n 17147
 
3.5%
t 17147
 
3.5%
o 17147
 
3.5%
A 16865
 
3.5%
Other values (7) 61222
12.6%
Common
ValueCountFrequency (%)
39339
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 525808
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 127980
24.3%
r 81137
15.4%
g 58945
11.2%
39339
 
7.5%
a 36029
 
6.9%
i 31973
 
6.1%
s 20877
 
4.0%
n 17147
 
3.3%
t 17147
 
3.3%
o 17147
 
3.3%
Other values (8) 78087
14.9%

Knowledge_4
Text

MISSING 

Distinct5
Distinct (%)< 0.1%
Missing47500
Missing (%)53.3%
Memory size1.4 MiB
2023-12-09T14:43:50.414260image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length26
Median length17
Mean length12.20607427
Min length5

Characters and Unicode

Total characters508798
Distinct characters18
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowAgree
2nd rowAgree
3rd rowStrongly agree
4th rowAgree
5th rowNeither agree nor disagree
ValueCountFrequency (%)
agree 34449
43.2%
disagree 17950
22.5%
neither 10715
 
13.4%
nor 10715
 
13.4%
strongly 5962
 
7.5%
2023-12-09T14:43:50.555763image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 126228
24.8%
r 79791
15.7%
g 58361
11.5%
38107
 
7.5%
a 33421
 
6.6%
i 28665
 
5.6%
A 18978
 
3.7%
s 17950
 
3.5%
t 16677
 
3.3%
n 16677
 
3.3%
Other values (8) 73943
14.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 429007
84.3%
Uppercase Letter 41684
 
8.2%
Space Separator 38107
 
7.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 126228
29.4%
r 79791
18.6%
g 58361
13.6%
a 33421
 
7.8%
i 28665
 
6.7%
s 17950
 
4.2%
t 16677
 
3.9%
n 16677
 
3.9%
o 16677
 
3.9%
d 11921
 
2.8%
Other values (3) 22639
 
5.3%
Uppercase Letter
ValueCountFrequency (%)
A 18978
45.5%
N 10715
25.7%
D 6029
 
14.5%
S 5962
 
14.3%
Space Separator
ValueCountFrequency (%)
38107
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 470691
92.5%
Common 38107
 
7.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 126228
26.8%
r 79791
17.0%
g 58361
12.4%
a 33421
 
7.1%
i 28665
 
6.1%
A 18978
 
4.0%
s 17950
 
3.8%
t 16677
 
3.5%
n 16677
 
3.5%
o 16677
 
3.5%
Other values (7) 57266
12.2%
Common
ValueCountFrequency (%)
38107
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 508798
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 126228
24.8%
r 79791
15.7%
g 58361
11.5%
38107
 
7.5%
a 33421
 
6.6%
i 28665
 
5.6%
A 18978
 
3.7%
s 17950
 
3.5%
t 16677
 
3.3%
n 16677
 
3.3%
Other values (8) 73943
14.5%

Knowledge_5
Text

MISSING 

Distinct5
Distinct (%)< 0.1%
Missing47657
Missing (%)53.4%
Memory size1.4 MiB
2023-12-09T14:43:50.633829image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length26
Median length5
Mean length10.9000891
Min length5

Characters and Unicode

Total characters452648
Distinct characters18
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowAgree
2nd rowAgree
3rd rowAgree
4th rowNeither agree nor disagree
5th rowStrongly disagree
ValueCountFrequency (%)
agree 36945
51.1%
disagree 12161
 
16.8%
strongly 8012
 
11.1%
neither 7579
 
10.5%
nor 7579
 
10.5%
2023-12-09T14:43:50.771365image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 113370
25.0%
r 72276
16.0%
g 57118
12.6%
30749
 
6.8%
a 26975
 
6.0%
A 22131
 
4.9%
i 19740
 
4.4%
t 15591
 
3.4%
n 15591
 
3.4%
o 15591
 
3.4%
Other values (8) 63516
14.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 380372
84.0%
Uppercase Letter 41527
 
9.2%
Space Separator 30749
 
6.8%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 113370
29.8%
r 72276
19.0%
g 57118
15.0%
a 26975
 
7.1%
i 19740
 
5.2%
t 15591
 
4.1%
n 15591
 
4.1%
o 15591
 
4.1%
s 12161
 
3.2%
d 8356
 
2.2%
Other values (3) 23603
 
6.2%
Uppercase Letter
ValueCountFrequency (%)
A 22131
53.3%
S 8012
 
19.3%
N 7579
 
18.3%
D 3805
 
9.2%
Space Separator
ValueCountFrequency (%)
30749
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 421899
93.2%
Common 30749
 
6.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 113370
26.9%
r 72276
17.1%
g 57118
13.5%
a 26975
 
6.4%
A 22131
 
5.2%
i 19740
 
4.7%
t 15591
 
3.7%
n 15591
 
3.7%
o 15591
 
3.7%
s 12161
 
2.9%
Other values (7) 51355
12.2%
Common
ValueCountFrequency (%)
30749
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 452648
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 113370
25.0%
r 72276
16.0%
g 57118
12.6%
30749
 
6.8%
a 26975
 
6.0%
A 22131
 
4.9%
i 19740
 
4.4%
t 15591
 
3.4%
n 15591
 
3.4%
o 15591
 
3.4%
Other values (8) 63516
14.0%

Knowledge_6
Text

MISSING 

Distinct5
Distinct (%)< 0.1%
Missing47664
Missing (%)53.4%
Memory size1.4 MiB
2023-12-09T14:43:50.855457image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length26
Median length17
Mean length12.84674855
Min length5

Characters and Unicode

Total characters533397
Distinct characters18
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowAgree
2nd rowAgree
3rd rowNeither agree nor disagree
4th rowAgree
5th rowNeither agree nor disagree
ValueCountFrequency (%)
agree 31652
38.6%
disagree 21221
25.9%
neither 11353
 
13.8%
nor 11353
 
13.8%
strongly 6420
 
7.8%
2023-12-09T14:43:50.996066image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 128452
24.1%
r 81999
15.4%
g 59293
11.1%
40479
 
7.6%
a 37702
 
7.1%
i 32574
 
6.1%
s 21221
 
4.0%
n 17773
 
3.3%
t 17773
 
3.3%
o 17773
 
3.3%
Other values (8) 78358
14.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 451398
84.6%
Uppercase Letter 41520
 
7.8%
Space Separator 40479
 
7.6%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 128452
28.5%
r 81999
18.2%
g 59293
13.1%
a 37702
 
8.4%
i 32574
 
7.2%
s 21221
 
4.7%
n 17773
 
3.9%
t 17773
 
3.9%
o 17773
 
3.9%
d 12645
 
2.8%
Other values (3) 24193
 
5.4%
Uppercase Letter
ValueCountFrequency (%)
A 15171
36.5%
N 11353
27.3%
D 8576
20.7%
S 6420
15.5%
Space Separator
ValueCountFrequency (%)
40479
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 492918
92.4%
Common 40479
 
7.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 128452
26.1%
r 81999
16.6%
g 59293
12.0%
a 37702
 
7.6%
i 32574
 
6.6%
s 21221
 
4.3%
n 17773
 
3.6%
t 17773
 
3.6%
o 17773
 
3.6%
A 15171
 
3.1%
Other values (7) 63187
12.8%
Common
ValueCountFrequency (%)
40479
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 533397
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 128452
24.1%
r 81999
15.4%
g 59293
11.1%
40479
 
7.6%
a 37702
 
7.1%
i 32574
 
6.1%
s 21221
 
4.0%
n 17773
 
3.3%
t 17773
 
3.3%
o 17773
 
3.3%
Other values (8) 78358
14.7%

Knowledge_7
Text

MISSING 

Distinct5
Distinct (%)< 0.1%
Missing47717
Missing (%)53.5%
Memory size1.4 MiB
2023-12-09T14:43:51.080599image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length26
Median length17
Mean length12.45286613
Min length5

Characters and Unicode

Total characters516383
Distinct characters18
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowAgree
2nd rowAgree
3rd rowAgree
4th rowStrongly agree
5th rowDisagree
ValueCountFrequency (%)
agree 32098
40.5%
disagree 19307
24.3%
neither 9938
 
12.5%
nor 9938
 
12.5%
strongly 8027
 
10.1%
2023-12-09T14:43:51.224414image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 122686
23.8%
r 79308
15.4%
g 59432
11.5%
37841
 
7.3%
a 35746
 
6.9%
i 29245
 
5.7%
s 19307
 
3.7%
n 17965
 
3.5%
t 17965
 
3.5%
o 17965
 
3.5%
Other values (8) 78923
15.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 437075
84.6%
Uppercase Letter 41467
 
8.0%
Space Separator 37841
 
7.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 122686
28.1%
r 79308
18.1%
g 59432
13.6%
a 35746
 
8.2%
i 29245
 
6.7%
s 19307
 
4.4%
n 17965
 
4.1%
t 17965
 
4.1%
o 17965
 
4.1%
d 11464
 
2.6%
Other values (3) 25992
 
5.9%
Uppercase Letter
ValueCountFrequency (%)
A 15659
37.8%
N 9938
24.0%
S 8027
19.4%
D 7843
18.9%
Space Separator
ValueCountFrequency (%)
37841
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 478542
92.7%
Common 37841
 
7.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 122686
25.6%
r 79308
16.6%
g 59432
12.4%
a 35746
 
7.5%
i 29245
 
6.1%
s 19307
 
4.0%
n 17965
 
3.8%
t 17965
 
3.8%
o 17965
 
3.8%
A 15659
 
3.3%
Other values (7) 63264
13.2%
Common
ValueCountFrequency (%)
37841
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 516383
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 122686
23.8%
r 79308
15.4%
g 59432
11.5%
37841
 
7.3%
a 35746
 
6.9%
i 29245
 
5.7%
s 19307
 
3.7%
n 17965
 
3.5%
t 17965
 
3.5%
o 17965
 
3.5%
Other values (8) 78923
15.3%

Knowledge_8
Text

MISSING 

Distinct5
Distinct (%)< 0.1%
Missing47780
Missing (%)53.6%
Memory size1.4 MiB
2023-12-09T14:43:51.307273image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length26
Median length17
Mean length12.76323544
Min length5

Characters and Unicode

Total characters528449
Distinct characters18
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowStrongly agree
2nd rowAgree
3rd rowAgree
4th rowAgree
5th rowDisagree
ValueCountFrequency (%)
agree 31994
39.4%
disagree 20368
25.1%
neither 10958
 
13.5%
nor 10958
 
13.5%
strongly 7009
 
8.6%
2023-12-09T14:43:51.448318image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 126640
24.0%
r 81287
15.4%
g 59371
11.2%
39883
 
7.5%
a 35932
 
6.8%
i 31326
 
5.9%
s 20368
 
3.9%
n 17967
 
3.4%
t 17967
 
3.4%
o 17967
 
3.4%
Other values (8) 79741
15.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 447162
84.6%
Uppercase Letter 41404
 
7.8%
Space Separator 39883
 
7.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 126640
28.3%
r 81287
18.2%
g 59371
13.3%
a 35932
 
8.0%
i 31326
 
7.0%
s 20368
 
4.6%
n 17967
 
4.0%
t 17967
 
4.0%
o 17967
 
4.0%
d 13361
 
3.0%
Other values (3) 24976
 
5.6%
Uppercase Letter
ValueCountFrequency (%)
A 16430
39.7%
N 10958
26.5%
S 7009
16.9%
D 7007
16.9%
Space Separator
ValueCountFrequency (%)
39883
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 488566
92.5%
Common 39883
 
7.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 126640
25.9%
r 81287
16.6%
g 59371
12.2%
a 35932
 
7.4%
i 31326
 
6.4%
s 20368
 
4.2%
n 17967
 
3.7%
t 17967
 
3.7%
o 17967
 
3.7%
A 16430
 
3.4%
Other values (7) 63311
13.0%
Common
ValueCountFrequency (%)
39883
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 528449
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 126640
24.0%
r 81287
15.4%
g 59371
11.2%
39883
 
7.5%
a 35932
 
6.8%
i 31326
 
5.9%
s 20368
 
3.9%
n 17967
 
3.4%
t 17967
 
3.4%
o 17967
 
3.4%
Other values (8) 79741
15.1%

Frequency_1
Text

MISSING 

Distinct5
Distinct (%)< 0.1%
Missing47268
Missing (%)53.0%
Memory size1.4 MiB
2023-12-09T14:43:51.532418image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length17
Median length16
Mean length13.21037313
Min length5

Characters and Unicode

Total characters553726
Distinct characters20
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1-2 times a week
2nd row6-10 times a week
3rd row1-2 times a week
4th row1-2 times a week
5th row1-2 times a week
ValueCountFrequency (%)
times 31209
23.0%
a 31209
23.0%
week 31209
23.0%
1-2 25528
18.8%
never 10707
 
7.9%
3-5 4100
 
3.0%
6-10 847
 
0.6%
10 734
 
0.5%
2023-12-09T14:43:51.679893image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 115041
20.8%
93627
16.9%
w 31209
 
5.6%
k 31209
 
5.6%
t 31209
 
5.6%
i 31209
 
5.6%
m 31209
 
5.6%
s 31209
 
5.6%
a 31209
 
5.6%
- 30475
 
5.5%
Other values (10) 96120
17.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 354918
64.1%
Space Separator 93627
 
16.9%
Decimal Number 63265
 
11.4%
Dash Punctuation 30475
 
5.5%
Uppercase Letter 10707
 
1.9%
Math Symbol 734
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 115041
32.4%
w 31209
 
8.8%
k 31209
 
8.8%
t 31209
 
8.8%
i 31209
 
8.8%
m 31209
 
8.8%
s 31209
 
8.8%
a 31209
 
8.8%
v 10707
 
3.0%
r 10707
 
3.0%
Decimal Number
ValueCountFrequency (%)
1 27109
42.8%
2 25528
40.4%
3 4100
 
6.5%
5 4100
 
6.5%
0 1581
 
2.5%
6 847
 
1.3%
Space Separator
ValueCountFrequency (%)
93627
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 30475
100.0%
Uppercase Letter
ValueCountFrequency (%)
N 10707
100.0%
Math Symbol
ValueCountFrequency (%)
+ 734
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 365625
66.0%
Common 188101
34.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 115041
31.5%
w 31209
 
8.5%
k 31209
 
8.5%
t 31209
 
8.5%
i 31209
 
8.5%
m 31209
 
8.5%
s 31209
 
8.5%
a 31209
 
8.5%
N 10707
 
2.9%
v 10707
 
2.9%
Common
ValueCountFrequency (%)
93627
49.8%
- 30475
 
16.2%
1 27109
 
14.4%
2 25528
 
13.6%
3 4100
 
2.2%
5 4100
 
2.2%
0 1581
 
0.8%
6 847
 
0.5%
+ 734
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 553726
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 115041
20.8%
93627
16.9%
w 31209
 
5.6%
k 31209
 
5.6%
t 31209
 
5.6%
i 31209
 
5.6%
m 31209
 
5.6%
s 31209
 
5.6%
a 31209
 
5.6%
- 30475
 
5.5%
Other values (10) 96120
17.4%

Frequency_2
Text

MISSING 

Distinct5
Distinct (%)< 0.1%
Missing47259
Missing (%)53.0%
Memory size1.4 MiB
2023-12-09T14:43:51.762536image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length17
Median length16
Mean length15.05268933
Min length5

Characters and Unicode

Total characters631084
Distinct characters20
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row10+ times a week
2nd row6-10 times a week
3rd row10+ times a week
4th row1-2 times a week
5th row1-2 times a week
ValueCountFrequency (%)
times 37916
24.4%
a 37916
24.4%
week 37916
24.4%
1-2 18930
12.2%
3-5 9809
 
6.3%
10 4794
 
3.1%
6-10 4383
 
2.8%
never 4009
 
2.6%
2023-12-09T14:43:51.907425image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 121766
19.3%
113748
18.0%
w 37916
 
6.0%
k 37916
 
6.0%
t 37916
 
6.0%
i 37916
 
6.0%
m 37916
 
6.0%
s 37916
 
6.0%
a 37916
 
6.0%
- 33122
 
5.2%
Other values (10) 97036
15.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 395196
62.6%
Space Separator 113748
 
18.0%
Decimal Number 80215
 
12.7%
Dash Punctuation 33122
 
5.2%
Math Symbol 4794
 
0.8%
Uppercase Letter 4009
 
0.6%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 121766
30.8%
w 37916
 
9.6%
k 37916
 
9.6%
t 37916
 
9.6%
i 37916
 
9.6%
m 37916
 
9.6%
s 37916
 
9.6%
a 37916
 
9.6%
v 4009
 
1.0%
r 4009
 
1.0%
Decimal Number
ValueCountFrequency (%)
1 28107
35.0%
2 18930
23.6%
3 9809
 
12.2%
5 9809
 
12.2%
0 9177
 
11.4%
6 4383
 
5.5%
Space Separator
ValueCountFrequency (%)
113748
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 33122
100.0%
Math Symbol
ValueCountFrequency (%)
+ 4794
100.0%
Uppercase Letter
ValueCountFrequency (%)
N 4009
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 399205
63.3%
Common 231879
36.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 121766
30.5%
w 37916
 
9.5%
k 37916
 
9.5%
t 37916
 
9.5%
i 37916
 
9.5%
m 37916
 
9.5%
s 37916
 
9.5%
a 37916
 
9.5%
N 4009
 
1.0%
v 4009
 
1.0%
Common
ValueCountFrequency (%)
113748
49.1%
- 33122
 
14.3%
1 28107
 
12.1%
2 18930
 
8.2%
3 9809
 
4.2%
5 9809
 
4.2%
0 9177
 
4.0%
+ 4794
 
2.1%
6 4383
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 631084
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 121766
19.3%
113748
18.0%
w 37916
 
6.0%
k 37916
 
6.0%
t 37916
 
6.0%
i 37916
 
6.0%
m 37916
 
6.0%
s 37916
 
6.0%
a 37916
 
6.0%
- 33122
 
5.2%
Other values (10) 97036
15.4%

Frequency_3
Text

MISSING 

Distinct5
Distinct (%)< 0.1%
Missing48130
Missing (%)54.0%
Memory size1.4 MiB
2023-12-09T14:43:51.992481image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length17
Median length16
Mean length12.78723145
Min length5

Characters and Unicode

Total characters524967
Distinct characters20
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNever
2nd row3-5 times a week
3rd row1-2 times a week
4th row3-5 times a week
5th row3-5 times a week
ValueCountFrequency (%)
times 28947
22.6%
a 28947
22.6%
week 28947
22.6%
1-2 21470
16.8%
never 12107
9.5%
3-5 5125
 
4.0%
6-10 1280
 
1.0%
10 1072
 
0.8%
2023-12-09T14:43:52.141514image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 111055
21.2%
86841
16.5%
w 28947
 
5.5%
k 28947
 
5.5%
t 28947
 
5.5%
i 28947
 
5.5%
m 28947
 
5.5%
s 28947
 
5.5%
a 28947
 
5.5%
- 27875
 
5.3%
Other values (10) 96567
18.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 337898
64.4%
Space Separator 86841
 
16.5%
Decimal Number 59174
 
11.3%
Dash Punctuation 27875
 
5.3%
Uppercase Letter 12107
 
2.3%
Math Symbol 1072
 
0.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 111055
32.9%
w 28947
 
8.6%
k 28947
 
8.6%
t 28947
 
8.6%
i 28947
 
8.6%
m 28947
 
8.6%
s 28947
 
8.6%
a 28947
 
8.6%
v 12107
 
3.6%
r 12107
 
3.6%
Decimal Number
ValueCountFrequency (%)
1 23822
40.3%
2 21470
36.3%
3 5125
 
8.7%
5 5125
 
8.7%
0 2352
 
4.0%
6 1280
 
2.2%
Space Separator
ValueCountFrequency (%)
86841
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 27875
100.0%
Uppercase Letter
ValueCountFrequency (%)
N 12107
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1072
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 350005
66.7%
Common 174962
33.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 111055
31.7%
w 28947
 
8.3%
k 28947
 
8.3%
t 28947
 
8.3%
i 28947
 
8.3%
m 28947
 
8.3%
s 28947
 
8.3%
a 28947
 
8.3%
N 12107
 
3.5%
v 12107
 
3.5%
Common
ValueCountFrequency (%)
86841
49.6%
- 27875
 
15.9%
1 23822
 
13.6%
2 21470
 
12.3%
3 5125
 
2.9%
5 5125
 
2.9%
0 2352
 
1.3%
6 1280
 
0.7%
+ 1072
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 524967
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 111055
21.2%
86841
16.5%
w 28947
 
5.5%
k 28947
 
5.5%
t 28947
 
5.5%
i 28947
 
5.5%
m 28947
 
5.5%
s 28947
 
5.5%
a 28947
 
5.5%
- 27875
 
5.3%
Other values (10) 96567
18.4%

TimeSearching
Text

MISSING 

Distinct5
Distinct (%)< 0.1%
Missing46406
Missing (%)52.0%
Memory size1.4 MiB
2023-12-09T14:43:52.228369image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length26
Median length19
Mean length20.04224134
Min length19

Characters and Unicode

Total characters857367
Distinct characters23
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row15-30 minutes a day
2nd row30-60 minutes a day
3rd row15-30 minutes a day
4th row60-120 minutes a day
5th row30-60 minutes a day
ValueCountFrequency (%)
minutes 42778
23.5%
a 42778
23.5%
day 42778
23.5%
30-60 16338
 
9.0%
15-30 11773
 
6.5%
60-120 7626
 
4.2%
less 3959
 
2.2%
than 3959
 
2.2%
15 3959
 
2.2%
over 3082
 
1.7%
2023-12-09T14:43:52.386146image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
139334
16.3%
a 89515
 
10.4%
0 62783
 
7.3%
s 50696
 
5.9%
e 49819
 
5.8%
n 46737
 
5.5%
t 46737
 
5.5%
d 42778
 
5.0%
y 42778
 
5.0%
m 42778
 
5.0%
Other values (13) 243412
28.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 507517
59.2%
Decimal Number 167738
 
19.6%
Space Separator 139334
 
16.3%
Dash Punctuation 35737
 
4.2%
Uppercase Letter 7041
 
0.8%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 89515
17.6%
s 50696
10.0%
e 49819
9.8%
n 46737
9.2%
t 46737
9.2%
d 42778
8.4%
y 42778
8.4%
m 42778
8.4%
i 42778
8.4%
u 42778
8.4%
Other values (3) 10123
 
2.0%
Decimal Number
ValueCountFrequency (%)
0 62783
37.4%
3 28111
16.8%
1 26440
15.8%
6 23964
 
14.3%
5 15732
 
9.4%
2 10708
 
6.4%
Uppercase Letter
ValueCountFrequency (%)
L 3959
56.2%
O 3082
43.8%
Space Separator
ValueCountFrequency (%)
139334
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 35737
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 514558
60.0%
Common 342809
40.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 89515
17.4%
s 50696
9.9%
e 49819
9.7%
n 46737
9.1%
t 46737
9.1%
d 42778
8.3%
y 42778
8.3%
m 42778
8.3%
i 42778
8.3%
u 42778
8.3%
Other values (5) 17164
 
3.3%
Common
ValueCountFrequency (%)
139334
40.6%
0 62783
18.3%
- 35737
 
10.4%
3 28111
 
8.2%
1 26440
 
7.7%
6 23964
 
7.0%
5 15732
 
4.6%
2 10708
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 857367
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
139334
16.3%
a 89515
 
10.4%
0 62783
 
7.3%
s 50696
 
5.9%
e 49819
 
5.8%
n 46737
 
5.5%
t 46737
 
5.5%
d 42778
 
5.0%
y 42778
 
5.0%
m 42778
 
5.0%
Other values (13) 243412
28.4%

TimeAnswering
Text

MISSING 

Distinct5
Distinct (%)< 0.1%
Missing46555
Missing (%)52.2%
Memory size1.4 MiB
2023-12-09T14:43:52.474100image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length26
Median length19
Mean length20.63621009
Min length19

Characters and Unicode

Total characters879701
Distinct characters23
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row15-30 minutes a day
2nd row30-60 minutes a day
3rd row30-60 minutes a day
4th row30-60 minutes a day
5th row15-30 minutes a day
ValueCountFrequency (%)
minutes 42629
22.5%
a 42629
22.5%
day 42629
22.5%
15-30 13678
 
7.2%
30-60 13013
 
6.9%
less 8321
 
4.4%
than 8321
 
4.4%
15 8321
 
4.4%
60-120 5674
 
3.0%
over 1943
 
1.0%
2023-12-09T14:43:52.631523image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
146472
16.7%
a 93579
 
10.6%
s 59271
 
6.7%
0 52995
 
6.0%
e 52893
 
6.0%
n 50950
 
5.8%
t 50950
 
5.8%
y 42629
 
4.8%
m 42629
 
4.8%
i 42629
 
4.8%
Other values (13) 244704
27.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 532995
60.6%
Decimal Number 157605
 
17.9%
Space Separator 146472
 
16.7%
Dash Punctuation 32365
 
3.7%
Uppercase Letter 10264
 
1.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 93579
17.6%
s 59271
11.1%
e 52893
9.9%
n 50950
9.6%
t 50950
9.6%
y 42629
8.0%
m 42629
8.0%
i 42629
8.0%
u 42629
8.0%
d 42629
8.0%
Other values (3) 12207
 
2.3%
Decimal Number
ValueCountFrequency (%)
0 52995
33.6%
1 29616
18.8%
3 26691
16.9%
5 21999
14.0%
6 18687
 
11.9%
2 7617
 
4.8%
Uppercase Letter
ValueCountFrequency (%)
L 8321
81.1%
O 1943
 
18.9%
Space Separator
ValueCountFrequency (%)
146472
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 32365
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 543259
61.8%
Common 336442
38.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 93579
17.2%
s 59271
10.9%
e 52893
9.7%
n 50950
9.4%
t 50950
9.4%
y 42629
7.8%
m 42629
7.8%
i 42629
7.8%
u 42629
7.8%
d 42629
7.8%
Other values (5) 22471
 
4.1%
Common
ValueCountFrequency (%)
146472
43.5%
0 52995
 
15.8%
- 32365
 
9.6%
1 29616
 
8.8%
3 26691
 
7.9%
5 21999
 
6.5%
6 18687
 
5.6%
2 7617
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 879701
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
146472
16.7%
a 93579
 
10.6%
s 59271
 
6.7%
0 52995
 
6.0%
e 52893
 
6.0%
n 50950
 
5.8%
t 50950
 
5.8%
y 42629
 
4.8%
m 42629
 
4.8%
i 42629
 
4.8%
Other values (13) 244704
27.8%

ProfessionalTech
Text

MISSING 

Distinct284
Distinct (%)0.7%
Missing47401
Missing (%)53.1%
Memory size1.4 MiB
2023-12-09T14:43:52.750717image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length264
Median length215
Mean length114.3824043
Min length13

Characters and Unicode

Total characters4779240
Distinct characters33
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)0.1%

Sample

1st rowDevOps function;Microservices;Automated testing;Observability tools
2nd rowDevOps function;Microservices;Automated testing;Observability tools;Innersource initiative;Developer portal or other central places to find tools/services;Continuous integration (CI) and (more often) continuous delivery
3rd rowAutomated testing;Continuous integration (CI) and (more often) continuous delivery
4th rowMicroservices;Automated testing;Observability tools;Continuous integration (CI) and (more often) continuous delivery
5th rowDevOps function;Microservices;Observability tools;Continuous integration (CI) and (more often) continuous delivery
ValueCountFrequency (%)
continuous 31128
 
6.6%
ci 30056
 
6.4%
often 30056
 
6.4%
more 30056
 
6.4%
and 30056
 
6.4%
integration 30056
 
6.4%
devops 25258
 
5.4%
delivery 24529
 
5.2%
places 15335
 
3.3%
to 15335
 
3.3%
Other values (63) 209048
44.4%
2023-12-09T14:43:52.958058image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
o 463849
 
9.7%
e 452264
 
9.5%
429130
 
9.0%
t 423114
 
8.9%
n 376277
 
7.9%
i 335709
 
7.0%
s 283106
 
5.9%
r 251484
 
5.3%
u 176786
 
3.7%
a 160406
 
3.4%
Other values (23) 1427115
29.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 3843896
80.4%
Space Separator 429130
 
9.0%
Uppercase Letter 242361
 
5.1%
Other Punctuation 124000
 
2.6%
Close Punctuation 66655
 
1.4%
Open Punctuation 66655
 
1.4%
Dash Punctuation 6543
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o 463849
12.1%
e 452264
11.8%
t 423114
11.0%
n 376277
9.8%
i 335709
8.7%
s 283106
 
7.4%
r 251484
 
6.5%
u 176786
 
4.6%
a 160406
 
4.2%
c 154820
 
4.0%
Other values (10) 766081
19.9%
Uppercase Letter
ValueCountFrequency (%)
C 60112
24.8%
I 42505
17.5%
O 41700
17.2%
D 40593
16.7%
A 31941
13.2%
M 20526
 
8.5%
N 4984
 
2.1%
Other Punctuation
ValueCountFrequency (%)
; 108665
87.6%
/ 15335
 
12.4%
Space Separator
ValueCountFrequency (%)
429130
100.0%
Close Punctuation
ValueCountFrequency (%)
) 66655
100.0%
Open Punctuation
ValueCountFrequency (%)
( 66655
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6543
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 4086257
85.5%
Common 692983
 
14.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
o 463849
11.4%
e 452264
11.1%
t 423114
10.4%
n 376277
 
9.2%
i 335709
 
8.2%
s 283106
 
6.9%
r 251484
 
6.2%
u 176786
 
4.3%
a 160406
 
3.9%
c 154820
 
3.8%
Other values (17) 1008442
24.7%
Common
ValueCountFrequency (%)
429130
61.9%
; 108665
 
15.7%
) 66655
 
9.6%
( 66655
 
9.6%
/ 15335
 
2.2%
- 6543
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4779240
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o 463849
 
9.7%
e 452264
 
9.5%
429130
 
9.0%
t 423114
 
8.9%
n 376277
 
7.9%
i 335709
 
7.0%
s 283106
 
5.9%
r 251484
 
5.3%
u 176786
 
3.7%
a 160406
 
3.4%
Other values (23) 1427115
29.9%

Industry
Text

MISSING 

Distinct12
Distinct (%)< 0.1%
Missing52410
Missing (%)58.8%
Memory size1.4 MiB
2023-12-09T14:43:53.069221image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length67
Median length46
Mean length42.47955077
Min length5

Characters and Unicode

Total characters1562143
Distinct characters38
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowInformation Services, IT, Software Development, or other Technology
2nd rowInformation Services, IT, Software Development, or other Technology
3rd rowOther
4th rowOther
5th rowInformation Services, IT, Software Development, or other Technology
ValueCountFrequency (%)
services 25531
13.6%
other 22170
11.8%
or 20766
11.1%
information 18159
9.7%
it 18159
9.7%
software 18159
9.7%
development 18159
9.7%
technology 18159
9.7%
financial 4421
 
2.4%
manufacturing 2607
 
1.4%
Other values (16) 21101
11.3%
2023-12-09T14:43:53.247112image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 175682
 
11.2%
o 156474
 
10.0%
150617
 
9.6%
r 119512
 
7.7%
n 101865
 
6.5%
t 90667
 
5.8%
a 69156
 
4.4%
i 66640
 
4.3%
, 59691
 
3.8%
c 54883
 
3.5%
Other values (28) 516956
33.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1187165
76.0%
Uppercase Letter 164394
 
10.5%
Space Separator 150617
 
9.6%
Other Punctuation 59967
 
3.8%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 175682
14.8%
o 156474
13.2%
r 119512
10.1%
n 101865
 
8.6%
t 90667
 
7.6%
a 69156
 
5.8%
i 66640
 
5.6%
c 54883
 
4.6%
l 48371
 
4.1%
h 46578
 
3.9%
Other values (10) 257337
21.7%
Uppercase Letter
ValueCountFrequency (%)
S 46297
28.2%
T 38925
23.7%
I 37025
22.5%
D 18159
 
11.0%
C 4562
 
2.8%
F 4421
 
2.7%
O 4287
 
2.6%
H 3458
 
2.1%
M 2607
 
1.6%
R 1955
 
1.2%
Other values (5) 2698
 
1.6%
Other Punctuation
ValueCountFrequency (%)
, 59691
99.5%
& 276
 
0.5%
Space Separator
ValueCountFrequency (%)
150617
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1351559
86.5%
Common 210584
 
13.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 175682
13.0%
o 156474
 
11.6%
r 119512
 
8.8%
n 101865
 
7.5%
t 90667
 
6.7%
a 69156
 
5.1%
i 66640
 
4.9%
c 54883
 
4.1%
l 48371
 
3.6%
h 46578
 
3.4%
Other values (25) 421731
31.2%
Common
ValueCountFrequency (%)
150617
71.5%
, 59691
 
28.3%
& 276
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1562143
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 175682
 
11.2%
o 156474
 
10.0%
150617
 
9.6%
r 119512
 
7.7%
n 101865
 
6.5%
t 90667
 
5.8%
a 69156
 
4.4%
i 66640
 
4.3%
, 59691
 
3.8%
c 54883
 
3.5%
Other values (28) 516956
33.1%

SurveyLength
Text

MISSING 

Distinct3
Distinct (%)< 0.1%
Missing2699
Missing (%)3.0%
Memory size1.4 MiB
2023-12-09T14:43:53.338799image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length21
Median length21
Mean length17.9372608
Min length8

Characters and Unicode

Total characters1551304
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowAppropriate in length
2nd rowAppropriate in length
3rd rowAppropriate in length
4th rowAppropriate in length
5th rowAppropriate in length
ValueCountFrequency (%)
appropriate 65962
27.6%
in 65962
27.6%
length 65962
27.6%
too 20523
 
8.6%
long 18605
 
7.8%
short 1918
 
0.8%
2023-12-09T14:43:53.491792image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
p 197886
12.8%
152447
9.8%
n 150529
9.7%
r 133842
8.6%
t 133842
8.6%
i 131924
8.5%
e 131924
8.5%
o 127531
8.2%
l 84567
 
5.5%
g 84567
 
5.5%
Other values (5) 222245
14.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1312372
84.6%
Space Separator 152447
 
9.8%
Uppercase Letter 86485
 
5.6%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
p 197886
15.1%
n 150529
11.5%
r 133842
10.2%
t 133842
10.2%
i 131924
10.1%
e 131924
10.1%
o 127531
9.7%
l 84567
6.4%
g 84567
6.4%
h 67880
 
5.2%
Other values (2) 67880
 
5.2%
Uppercase Letter
ValueCountFrequency (%)
A 65962
76.3%
T 20523
 
23.7%
Space Separator
ValueCountFrequency (%)
152447
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1398857
90.2%
Common 152447
 
9.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
p 197886
14.1%
n 150529
10.8%
r 133842
9.6%
t 133842
9.6%
i 131924
9.4%
e 131924
9.4%
o 127531
9.1%
l 84567
6.0%
g 84567
6.0%
h 67880
 
4.9%
Other values (4) 154365
11.0%
Common
ValueCountFrequency (%)
152447
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1551304
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
p 197886
12.8%
152447
9.8%
n 150529
9.7%
r 133842
8.6%
t 133842
8.6%
i 131924
8.5%
e 131924
8.5%
o 127531
8.2%
l 84567
 
5.5%
g 84567
 
5.5%
Other values (5) 222245
14.3%

SurveyEase
Text

MISSING 

Distinct3
Distinct (%)< 0.1%
Missing2630
Missing (%)2.9%
Memory size1.4 MiB
2023-12-09T14:43:53.567169image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Length

Max length26
Median length4
Mean length11.98121404
Min length4

Characters and Unicode

Total characters1037022
Distinct characters19
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowEasy
2nd rowEasy
3rd rowEasy
4th rowNeither easy nor difficult
5th rowNeither easy nor difficult
ValueCountFrequency (%)
easy 85180
47.4%
difficult 32462
 
18.1%
neither 31088
 
17.3%
nor 31088
 
17.3%
2023-12-09T14:43:53.702287image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
i 96012
 
9.3%
e 93264
 
9.0%
93264
 
9.0%
a 85180
 
8.2%
s 85180
 
8.2%
y 85180
 
8.2%
f 64924
 
6.3%
t 63550
 
6.1%
r 62176
 
6.0%
E 54092
 
5.2%
Other values (9) 254200
24.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 857204
82.7%
Space Separator 93264
 
9.0%
Uppercase Letter 86554
 
8.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i 96012
11.2%
e 93264
10.9%
a 85180
9.9%
s 85180
9.9%
y 85180
9.9%
f 64924
7.6%
t 63550
7.4%
r 62176
 
7.3%
c 32462
 
3.8%
u 32462
 
3.8%
Other values (5) 156814
18.3%
Uppercase Letter
ValueCountFrequency (%)
E 54092
62.5%
N 31088
35.9%
D 1374
 
1.6%
Space Separator
ValueCountFrequency (%)
93264
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 943758
91.0%
Common 93264
 
9.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
i 96012
10.2%
e 93264
9.9%
a 85180
 
9.0%
s 85180
 
9.0%
y 85180
 
9.0%
f 64924
 
6.9%
t 63550
 
6.7%
r 62176
 
6.6%
E 54092
 
5.7%
c 32462
 
3.4%
Other values (8) 221738
23.5%
Common
ValueCountFrequency (%)
93264
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1037022
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i 96012
 
9.3%
e 93264
 
9.0%
93264
 
9.0%
a 85180
 
8.2%
s 85180
 
8.2%
y 85180
 
8.2%
f 64924
 
6.3%
t 63550
 
6.1%
r 62176
 
6.0%
E 54092
 
5.2%
Other values (9) 254200
24.5%

ConvertedCompYearly
Real number (ℝ)

MISSING  SKEWED 

Distinct8784
Distinct (%)18.3%
Missing41165
Missing (%)46.2%
Infinite0
Infinite (%)0.0%
Mean103110.0817
Minimum1
Maximum74351432
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.4 MiB
2023-12-09T14:43:53.792960image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6502.6
Q143907
median74963
Q3121641
95-th percentile230000
Maximum74351432
Range74351431
Interquartile range (IQR)77734

Descriptive statistics

Standard deviation681418.8387
Coefficient of variation (CV)6.608653852
Kurtosis9673.712823
Mean103110.0817
Median Absolute Deviation (MAD)36563
Skewness94.73829622
Sum4951243014
Variance4.643316338 × 1011
MonotonicityNot monotonic
2023-12-09T14:43:53.871488image/svg+xmlMatplotlib v3.8.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
64254 784
 
0.9%
53545 615
 
0.7%
150000 585
 
0.7%
200000 558
 
0.6%
74963 557
 
0.6%
85672 532
 
0.6%
120000 484
 
0.5%
107090 464
 
0.5%
42836 420
 
0.5%
69608 400
 
0.4%
Other values (8774) 42620
47.8%
(Missing) 41165
46.2%
ValueCountFrequency (%)
1 18
< 0.1%
2 12
< 0.1%
3 7
 
< 0.1%
4 10
< 0.1%
5 9
< 0.1%
ValueCountFrequency (%)
74351432 1
< 0.1%
73607918 1
< 0.1%
72714292 1
< 0.1%
57513831 1
< 0.1%
36573181 1
< 0.1%